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Preface to Analysis of Functions of a Single Variable: A Detailed 
Development 

This is the preface to Lawrence Baggett's book, Analysis of Functions of a 
Single Variable, a Detailed Development. It provides a brief overview, 
including links, of the material covered in the book. 


For Christy My Light 


I have written this book primarily for serious and talented mathematics 
scholars, seniors or first-year graduate students, who by the time they finish 
their schooling should have had the opportunity to study in some detail the 
great discoveries of our subject. What did we know and how and when did 
we know it? I hope this book is useful toward that goal, especially when it 
comes to the great achievements of that part of mathematics known as 
analysis. I have tried to write a complete and thorough account of the 
elementary theories of functions of a single real variable and functions of a 
single complex variable. Separating these two subjects does not at all jive 
with their development historically, and to me it seems unnecessary and 
potentially confusing to do so. On the other hand, functions of several 
variables seems to me to be a very different kettle of fish, so I have decided 
to limit this book by concentrating on one variable at a time. 


Everyone is taught (told) in school that the area of a circle is given by the 
formula A = mr?. We are also told that the product of two negatives is a 
positive, that you cant trisect an angle, and that the square root of 2 is 
irrational. Students of natural sciences learn that e’” = —1 and that 

sin? + cos?= 1. More sophisticated students are taught the Fundamental 
Theorem of calculus and the Fundamental Theorem of Algebra. Some are 
also told that it is impossible to solve a general fifth degree polynomial 
equation by radicals. On the other hand, very few people indeed have the 
opportunity to find out precisely why these things are really true, and at the 
same time to realize just how intellectually deep and profound these “facts” 
are. Indeed, we mathematicians believe that these facts are among the most 
marvelous accomplishments of the human mind. Engineers and scientists 
can and do commit such mathematical facts to memory, and quite often 
combine them to useful purposes. However, it is left to us mathematicians 
to share the basic knowledge of why and how, and happily to us this is more 


a privilege than a chore. A large part of what makes the verification of such 
simple sounding and elementary truths so difficult is that we of necessity 
must spend quite a lot of energy determining what the relevant words 
themselves really mean. That is, to be quite careful about studying 
mathematics, we need to ask very basic questions: What is a circle? What 
are numbers? What is the definition of the area of a set in the Euclidean 
plane? What is the precise definition of numbers like 7,2, and e? We surely 
cannot prove that e*” = —1 without a clear definition of these particular 
numbers. The mathematical analysis story is a long one, beginning with the 
early civilizations, and in some sense only coming to a satisfactory 
completion in the late nineteenth century. It is a story of ideas, well worth 
learning. 


There are many many fantastic mathematical truths (facts), and it seems to 
me that some of them are so beautiful and fundamental to human 
intellectual development, that a student who wants to be called a 
mathematician, ought to know how to explain them, or at the very least 
should have known how to explain them at some point. Each professor 
might make up a slightly different list of such truths. Here is mine: 


. The square root of 2 is a real number but is not a rational number. 
. The formula for the area of a circle of radius r is A = mr. 
. The formula for the circumference of a circle of radius r is C = 27r. 
et S-1, 

b 
. The Fundamental Theorem of Calculus, [, f (t) dt = F (b) — F (a). 


. The Fundamental Theorem of Algebra, every nonconstant polynomial 
has at least one root in the complex numbers. 

7. It is impossible to trisect an arbitrary angle using only a compass and 

straight edge. 
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Other mathematical marvels, such as the fact that there are more real 
numbers than there are rationals, the set of all sets is not a set, an arbitrary 
fifth degree polynomial equation can not be solved in terms of radicals, a 
simple closed curve divides the plain into exactly two components, there 
are an infinite number of primes, etc., are clearly wonderful results, but the 
seven in the list above are really of a more primary nature to me, an analyst, 
for they stem from the work of ancient mathematicians and except for 


number 7, which continues to this day to evoke so-called disproofs, have 
been accepted as true by most people even in the absence of precise 
“arguments” for hundreds if not thousands of years. Perhaps one should 
ruminate on why it took so long for us to formulate precise definitions of 
things like numbers and areas? 


Only with the advent of calculus in the seventeenth century, together with 
the contributions of people like Euler, Cauchy, and Weierstrass during the 
next two hundred years, were the first six items above really proved, and 
only with the contributions of Galois in the early nineteenth century was the 
last one truly understood. 


This text, while including a traditional treatment of introductory analysis, 
specifically addresses, as kinds of milestones, the first six of these truths 
and gives careful derivations of them. The seventh, which looks like an 
assertion from geometry, turns out to be an algebraic result that is not 
appropriate for this course in analysis, but in my opinion it should definitely 
be presented in an undergraduate algebra course. As for the first six, I insist 
here on developing precise mathematical definitions of all the relevant 
notions, and moving step by step through their derivations. Specifically, 
what are the definitions of /2, A, 7,7, 7, C, 2, e,i,, and —1? My feeling 
is that mathematicians should understand exactly where these concepts 
come from in precise mathematical terms, why it took so long to discover 
these definitions, and why the various relations among them hold. 


The numbers —1, 2, and z can be disposed of fairly quickly by a discussion 
of what exactly is meant by the real and complex number systems. Of 
course, this is in fact no trivial matter, having had to wait until the end of 
the nineteenth century for a clear explanation, and in fact I leave the actual 
proof of the existence of the real numbers to an appendix. However, a 
complete mathematics education ought to include a study of this proof, and 
if one finds the time in this analysis course, it really should be included 
here. Having a definition of the real numbers to work with, i.e., having 
introduced the notion of least upper bound, one can relatively easily prove 
that there is a real number whose square is 2, and that this number can not 
be a rational number, thereby disposing of the first of our goals. All this is 
done in [link]. Maintaining the attitude that we should not distinguish 


between functions of a real variable and functions of a complex variable, at 
least at the beginning of the development, [link] concludes with a careful 
introduction of the basic properties of the field of complex numbers. 


unlike the elementary numbers —1, 2, and 7, the definitions of the real 
numbers e and 7 are quite a different story. In fact, one cannot make sense 
of either e or 7 until a substantial amount of analysis has been developed, 
for they both are necessarily defined somehow in terms of a limit process. I 
have chosen to define e here as the limit of the rather intriguing sequence 

{ (1 + +) ie? in some ways the first nontrivial example of a convergent 


sequence, and this is presented in [link]. Its relation to logarithms and 
exponentials, whatever they are, has to be postponed to [link]. [link] also 
contains a section on the elementary topological properties (compactness, 
limit points, etc.) of the real and complex numbers as well as a thorough 
development of infinite series. 


To define 7 as the ratio of the circumference of a circle to its diameter is 
attractive, indeed was quite acceptable to Euclid, but is dangerously 
imprecise unless we have at the outset a clear definition of what is meant by 
the length of a curve, e.g., the circumference of a circle. That notion is by 
no means trivial, and in fact it only can be carefully treated in a 
development of analysis well after other concepts. Rather, I have chosen to 
define 7 here as the smallest positive zero of the sine function. Of course, I 
have to define the sine function first, and this is itself quite deep. I do it 
using power series functions, choosing to avoid the common definition of 
the trigonometric functions in terms of “ wrapping” the real line around a 
circle, for that notion again requires a precise definition of arc length before 
it would make sense. I get to arc length eventually, but not until [link]. 


In [link] I introduce power series functions as generalizations of 
polynomials, specifically the three power series functions that turn out to be 
the exponential, sine, and cosine functions. From these definitions it follows 
directly that exp 7z =cos z +72 sin z for every complex number z. Here is 
a place where allowing the variable to be complex is critical, and it has cost 
us nothing. However, even after establishing that there is in fact a smallest 
positive zero of the sine function (which we decide to call 7, since we know 
how we want things to work out), one cannot at this point deduce that 


cos 7 = —1, so that the equality e’** = —1 also has to wait for its 
derivation until [link]. In fact, more serious, we have no knowledge at all at 
this point of the function e* for a complex exponent z. What does it mean to 
raise a real number, or even an integer, to a complex exponent? The very 
definition of such a function has to wait. 


[link] also contains all the standard theorems about continuous functions, 
culminating with a lengthy section on uniform convergence, and finally 
Abel's fantastic theorem on the continuity of a power series function on the 
boundary of its disk of convergence. 


The fourth chapter begins with all the usual theorems from calculus, Mean 
Value Theorem, Chain Rule, First Derivative Test, and so on. Power series 
functions are shown to be differentiable, from which the law of exponents 
emerges for the power series function exp. Immediately then, all of the 
trigonometric and exponential identities are also derived. We observe that 
e’ =exp (r) for every rational number r, and we at last can define 
consistently e* to be the value of the power series function exp (z) for any 
complex number z. From that, we establish the equation e’” = —1. Careful 
proofs of Taylor's Remainder Theorem and L'Hopital's Rule are given, as 
well as an initial approach to the general Binomial Theorem for non-integer 
exponents. 


It is in [link] that the first glimpse of a difference between functions of a 
real variable and functions of a complex variable emerges. For example, 
one of the results in this chapter is that every differentiable, real-valued 
function of a complex variable must be a constant function, something that 
is certainly not true for functions of a real variable. At the end of this 
chapter, I briefly slip into the realm of real-valued functions of two real 
variables. I introduce the definition of differentiability of such a function of 
two real variables, and then derive the initial relationships among the partial 
derivatives of such a function and the derivative of that function thought of 
as a function of a complex variable. This is obviously done in preparation 
for Chapter VII where holomorphic functions are central. 


Perhaps most well-understood by math majors is that computing the area 
under a curve requires Newton's calculus, i.e., integration theory. What is 


often overlooked by students is that the very definition of the concept of 
area is intimately tied up with this integration theory. My treatment here of 
integration differs from most others in that the class of functions defined as 
integrable are those that are uniform limits of step functions. This is a 
smaller collection of functions than those that are Riemann-integrable, but 
they suffice for my purposes, and this approach serves to emphasize the 
importance of uniform convergence. In particular, I include careful proofs 
of the Fundamental Theorem of Calculus, the integration by substitution 
theorem, the integral form of Taylor's Remainder Theorem, and the 
complete proof of the general Binomial Theorem. 


Not wishing to delve into the set-theoretic complications of measure theory, 
I have chosen only to define the area for certain “geometric” subsets of the 
plane. These are those subsets bounded above and below by graphs of 
continuous functions. Of course these suffice for most purposes, and in 
particular circles are examples of such geometric sets, so that the formula 

A = mr? can be established for the area of a circle of radius r. [link] 
concludes with a development of integration over geometric subsets of the 
plane. Once again, anticipating later needs, we have again strayed into some 
investigations of functions of two real variables. 


Having developed the notions of arc length in the early part of [link], 
including the derivation of the formula for the circumference of a circle, I 
introduce the idea of a contour integral, i.e., integrating a function around a 
curve in the complex plane. The Fundamental Theorem of Calculus has 
generalizations to higher dimensions, and it becomes Green's Theorem in 2 
dimensions. I give a careful proof in [link], just over geometric sets, of this 
rather complicated theorem. 


Perhaps the main application of Green's Theorem is the Cauchy Integral 
Theorem, a result about complex-valued functions of a complex variable, 
that is often called the Fundamental Theorem of Analysis. I prove this 
theorem in [link]. From this Cauchy theorem one can deduce the usual 
marvelous theorems of a first course in complex variables, e.g., the Identity 
Theorem, Liouville's Theorem, the Maximum Modulus Principle, the Open 
Mapping Theorem, the Residue Theorem, and last but not least our 
mathematical truth number 6, the Fundamental Theorem of Algebra. That 


so much mathematical analysis is used to prove the fundamental theorem of 
algebra does make me smile. I will leave it to my algebraist colleagues to 
point out how some of the fundamental results in analysis require 
substantial algebraic arguments. 


The overriding philosophical point of this book is that many analytic 
assertions in mathematics are intellectually very deep; they require years of 
study for most people to understand; they demonstrate how intricate 
mathematical thought is and how far it has come over the years. Graduates 
in mathematics should be proud of the degree they have earned, and they 
should be proud of the depth of their understanding and the extremes to 
which they have pushed their own intellect. I love teaching these students, 
that is to say, I love sharing this marvelous material with them. 


Definition of the Numbers 1, i, and the square root of 2 

The overview of the main points of the chapter on real and complex 
numbers: least upper bound, greatest lower bound, real numbers, geometric 
progression, the triangle inequality, and the binomial theorem. 


In order to make precise sense out of the concepts we study in mathematical 
analysis, we must first come to terms with what the "real numbers" are. 
Everything in mathematical analysis is based on these numbers, and their 
very definition and existence is quite deep. We will, in fact, not attempt to 
demonstrate (prove) the existence of the real numbers in the body of this 
text, but will content ourselves with a careful delineation of their properties, 
referring the interested reader to an appendix for the existence and 
uniqueness proofs. 


Although people may always have had an intuitive idea of what these real 
numbers were, it was not until the nineteenth century that mathematically 
precise definitions were given. The history of how mathematicians came to 
realize the necessity for such precision in their definitions is fascinating 
from a philosophical point of view as much as from a mathematical one. 
However, we will not pursue the philosophical aspects of the subject in this 
book, but will be content to concentrate our attention just on the 
mathematical facts. These precise definitions are quite complicated, but the 
powerful possibilities within mathematical analysis rely heavily on this 
precision, sO we must pursue them. Toward our primary goals, we will in 


this chapter give definitions of the symbols (numbers) and 
The main points of this chapter are the following: 


1. The notions of least upper bound (supremum) and greatest lower 
bound (infimum) of a set of numbers, 

2. The definition of the real numbers 

3. the formula for the sum of a geometric progression ([link]), 

4. the Binomial Theorem ((link]), and 

5. the triangle inequality for complex numbers ((link]). 


The Natural Numbers and the Integers 

Algebraic relations include commutativity, associativity, and distributivity. 
The axiom of mathematical induction is stated and employed as a method 
of proof. A generalized version of the axiom is also mentioned, along with 
an explanation of negative numbers and integers. 


We will take for granted that we understand the existence of what we call 
the natural numbers, i.e., the set NV whose elements are the numbers 

1,2, 3,4, .... Indeed, the two salient properties of this set are that (a) there is 
a frist element (the natural number 1), and (b) for each element n of this set 
there is a “very next” one, i.e., an immediate successor. We assume that the 
algebraic notions of sum and product of natural numbers is well-defined 
and familiar. These operations satisfy three basic relations: 


Basic Algebraic Relations. 


1. (Commutativity)n +m =m-+nandn x m=m x n for all 
n,meN. 

2. (Associativity) n + (m+ k) = (n+m) +k and 
nx (mx k)=(nxm) x kforalln,m,k € N. 

3. (Distributivity) n x (m+k)=nxm+nxkforalln,m,k € N. 


We also take as given the notion of one natural number being larger than 
another one. 2 > 1,5 > 3,n + 1 > n, etc. We will accept as true the axiom 
of mathematical induction, that is, the following statement: 


AXIOM OF MATHEMATICAL INDUCTION. Let S be a subset of the 
set N of natural numbers. Suppose that 


de 1G. Ss 
2. If a natural number k is in S, then the natural number k + 1 also is in 
Rye 
Then S = N. 


That is, every natural number n belongs to S. 


REMARK The axiom of mathematical induction is for our purposes 
frequently employed as a method of proof. That is, if we wish to show that 
a certain proposition holds for all natural numbers, then we let S denote the 
set of numbers for which the proposition is true, and then, using the axiom 
of mathematical induction, we verify that S is all of N by showing that S 
satisfies both of the above conditions. Mathematical induction can also be 
used as a method of definition. That is, using it, we can define an infinite 
number of objects {O,,} that are indexed by the natural numbers. Think of 
S as the set of natural numbers for which the object O, is defined. We 
check first to see that the object O is defined. We check next that, if the 
object O; is defined for a natural number k, then there is a prescribed 
procedure for defining the object Oz+1. So, by the axiom of mathematical 
induction, the object is defined for all natural numbers. This method of 
defining an infinite set of objects is often referred to as sl recursive 
definition, or definition by recursion. 


As an example of recursive definition, let us carefully define 
exponentiation. 


Let a be a natural number. We define inductively natural numbers a” 
as follows: a! = a, and, whenever at is defined, then a**! is defined 
to bea x a*. 


The set S of all natural numbers for which a” is defined is therefore all of 
N. For, a! is defined, and if a” is defined there is a prescription for defining 
a*+1, This “careful” definition of a” may seem unnecessarily detailed. Why 
not simply define a” asa X a X @ X a... X an times? The answer is that 
the ..., though suggestive enough, is just not mathematically precise. After 
all, how would you explain what ... means? The answer to that is that you 
invent a recursive definition to make the intuitive meaning of the ... 
mathematically precise. We will of course use the symbol ... to simplify and 
shorten our notation, but keep in mind that, if pressed, we should be able to 
provide a careful definition. 

Exercise: 


Problem: 


a. Derive the three laws of exponents for the natural numbers: 
a't™ — q” x a’. HINT: Fix a and m and use the axiom of 
mathematical induction. a”*” = (a™)". HINT: Fix a and m and 
use the axiom of mathematical induction. (a x b)” = a” x 6". 
HINT: Fix a and 6 and use the axiom of mathematical induction. 

b. Define inductively numbers {S;} as follows: S; = 1, and if S; is 
defined, then S;,,1 is defined to be S; + k + 1. Prove, by 
induction, that S, = n(n + 1)/2. Note that we could have 
defined S,, using the ... notation by S, =1+2+3-+4...+n. 

c. Prove that 
Equation: 


fd aco eit wie a ee) 
see | 


d. Make a recursive definition of n! = 1x2x3x... x n.nl is 
called n factorial. 


There is a slightly more general statement of the axiom of mathematical 
induction, which is sometimes of use. 


GENERAL AXIOM OF MATHEMATICAL INDUCTION Let S be a 
subset of the set N of natural numbers, and suppose that S satisfies the 
following conditions 


1. There exists a natural number Ko such that ko € S. 
2. If S contains a natural number k, then S contains the natural number 
aa 


Then S contains every natural number n that is larger than or equal to ko. 


From the fundamental set V of natural numbers, we construct the set Z of 
all integers. First, we simply create an additional number called 0 that 
satisfies the equations 0 + n = n for alln € N andO x n = 0 for all 

nm € N. The word “create” is, for some mathematicians, a little unsettling. 
In fact, the idea of zero did not appear in mathematics until around the year 


900. It is easy to see how the so-called natural numbers came by their name. 
Fingers, toes, trees, fish, etc., can all be counted, and the very concept of 
counting is what the natural numbers are about. On the other hand, one 
never needed to count zero fingers or fish, so that the notion of zero as a 
number easily could have only come into mathematics at a later time, a time 
when arithmetic was becoming more sophisticated. In any case, from our 
twenty-first century viewpoint, 0 seems very understandable, and we won't 
belabor the fundamental question of its existence any further here. 


Next, we introduce the so-called negative numbers. This is again quite 
reasonable from our point of view. For every natural number n, we let —n 
be a number which, when added to n, give 0. Again, the question of 
whether or not such negative numbers exist will not concern us here. We 
simply create them. 


In short, we will take as given the existence of a set Z, called the integers, 
which comprises the set V of natural numbers, the additional number 0, 
and the set —N of all negative numbers. We assume that addition and 
multiplication of integers satisfy the three basic algebraic relations of 
commutativity, associativity, and distributivity stated above. We also 
assume that the following additional relations hold: 

Equation: 


(—n) x (—k) =nxk, and (—n) x k=nx (—k) = —(n x k) 


for all natural numbers n and k. 


The Rational Numbers 
Rational numbers are thought of as quotients k/n of integers. Fields are 
defined, and examples of fields are given. 


Next, we discuss the set Q of rational numbers, which we ordinarily think 
of as quotients k/n of integers. Of course, we do not allow the “second” 
element n of the quotient k/n to be 0. Also, we must remember that there 
isn't a 1-1 correspondence between the set Q of all rational numbers and the 
set of all such quotients k/n. Indeed, the two distinct quotients 2/3 and 6/9 
represent the same rational number. To be precise, the set Q is a collection 
of equivalence classes of ordered pairs (k, n) of integers, for which the 
second component of the pair is not 0. The equivalence relation among 
these ordered pairs is this: 

Equation: 


(k,n) = (k,n') if kxn’=nx K. 


We will not dwell on this possibly subtle definition, but will rather accept 
the usual understanding of the rational numbers and their arithmetic 
properties. In particular, we will represent them as quotients rather than as 
ordered pairs, and, if r is a rational number, we will write r = k/n, instead 
of writing r as the equivalence class containing the ordered pair (k,n). As 
usual, we refer to the first integer in the quotient k/n as the numerator and 
the second (nonzero) integer in the quotient k/n as the denominator of the 
quotient. The familiar definitions of sum and product for rational numbers 
are these: 


Equation: 
k k! 7 kn! + nk! 
n no nn! 
and 
Equation: 
k k! kk! 
a . 
n n! nn! 


Addition and multiplication of rational numbers satisfy the three basic 
algebraic relations of commutativity, associativity and distributivity stated 


earlier. 


We note that the integers Z can be identified in an obvious way as a subset 
of the rational numbers Q. Indeed, we identify the integer k with the 
quotient k/1. In this way, we note that Q contains the two numbers 

0 = 0/1 and 1 = 1/1. Notice that any other quotient k/n that is equivalent 
to 0/1 must satisfy & = 0, and any other quotient k/n that is equivalent to 
1/1 must satisfy k = n. Remember, k/n = k'/n’ if and only if kn’ = k’n. 


The set @ has an additional property not shared by the set of integers Z. It 
is this: For each nonzero element r € Q, there exists an element r’ € Q for 
which r x r’ = 1. Indeed, if r = k/n # 0, then k ¥ 0, and we may define 
r’ = n/k. Consequently, the set Q of all rational numbers is what is known 
in mathematics as a field. 


A field is anonempty set F’ on which there are defined two binary 
operations, addition (+) and multiplication (x), such that the 
following six axioms hold: 


oF 


2 


Both addition and multiplication are commutative and 
associative. 


. Multiplication is distributive over addition; i.e., 


Equation: 
ex(ytz)="2xytuxz 


forall 2, y,2 € F. 


. There exists an element in F’, which we will denote by 0, that is 


an identity for addition; i.e., x + 0 = x forallz € F. 


. There exists a nonzero element in F’, which we will denote by 1, 


that is an identity for multiplication; i.e., x x 1 = = for all 
et. 


. If x € F, then there exists a unique element y € F’ such that 


x + y = 0. This element y is called the additive inverse of x and 


is denoted by —z. 

6. If « € F and x ¥ 0, then there exists a unique element y € F 
such that x x y = 1. This element y is called the multiplicative 
inverse of x and is denoted by «?. 


REMARK. There are many examples of fields. (See [link].) They all share 
certain arithmetic properties, which can be derived from the axioms above. 
If z is an element of a field F’, then according to one of the axioms above, 
we have that 1 x x = a. (Note that this “1” is the multiplicative identity of 
the field F’ and not the natural number 1.) However, it is tempting to write 
x+a= 2x az inthe field Ff’. The “2” here is not a priori an element of F’, 
so that the equation x + x = 2 x z is not really justified. This is an 
example of a situation where a careful recursive definition can be useful. 


If z is an element of a field F’, define inductively elements n - x = nz 


of F by 1-2 =a, and, if k- x is defined, set (kK+1)-cr=a2+k-za. 
The set S of all natural numbers n for which n - z is defined is 
therefore, by the axiom of mathematical induction, all of NV. 


Usually we will write nx instead of n - x. Of course, nz is just the element 
of F' obtained by adding z to itselfn times:nza =x+a+a2+4+...42. 
Exercise: 


Problem: 


a. 


b. 


Justify for yourself that the set Q of all rational numbers is a 
field. That is, carefully verify that all six of the axioms hold. 
Let F denote the seven elements {0, 1, 2, 3,4,5, 6}. Define 
addition and multiplication on F7 as ordinary addition and 
multiplication mod 7. Prove that F7 is a field. (You may assume 
that axioms (1) and (2) hold. Check only conditions (3)-(6).) 
Show in addition that 7z = 0 for every x € Fr. 


. Let Fg denote the set consisting of the nine elements 


{0, 1, 2, 3, 4,5, 6, 7,8}. Define addition and multiplication on F9 
to be ordinary addition and multiplication mod 9. Show that F% is 
not a field. Which of the axioms fail to hold? 


. Show that the set NV of natural numbers is not a field. Which of 


the field axioms fail to hold? Show that the set Z of all integers is 


not a field. Which of the field axioms fail to hold? 


Exercise: 


Problem: 


Let F' be any field. Verify that the following arithmetic properties hold 
in F’. 


a. 0 x « = 0 for all z € F. HINT: Use the distributive law and the 
fact thatO = 0+ 0. 

b. If x and y are nonzero elements of F’, then x x y is nonzero. And, 
the multiplicative inverse of x x y satisfies 
(exy)t=atxyth 

c. (—1) x « = (-2) forall a € F. 

d.(—2) x (-y) = 2x yforallz,y € F. 

erxx-—yxy=(e£-y) x (a+ y). 

fi.(ety)x(e@+y)=xa2xa4+2-axytyxy. 


Let F’ be a field, and let x be a nonzero element of F’. 


For each natural number n, we define inductively an element x” in F’ 
as follows: x! = x, and, if zx* is defined, set x*+! = x x x*. Of 
course, x” is just the product of nz's. 


Define z° to be 1. 


For each natural number n, define x~” to be the multiplicative inverse 
(x”)~' of the element 2”. 


Finally, we define 0” to be 0 for every positive integer m, and we 
leave 0~” and 0° undefined. 


We have therefore defined x” for every nonzero x and every integer 
ME Z. 
Exercise: 


Problem: Let F' be a field. Derive the following laws of exponents: 


a. xt™ = x” x a for all nonzero elements x € F and all integers 


n and m. HINT: Fix x € F'andm € Z and use induction to 
derive this law for all natural numbers n. Then use the fact that in 
any field(z x y) -=a txyt 

b. 2”*™ = (x2™)" for all nonzero x € F' andalln,m € Z. 

c. (x x y)" = 2" x y” for all nonzero x,y € F andalln € Z. 


From now on, we will indicate multiplication in a field by juxtaposition; 
Le., x x y will be denoted simply as xy. Also, we will use the standard 
fractional notation to indicate multiplicative inverses. For instance, 
Equation: 


The Real Numbers 

Algebraic and positivity probabilities of real numbers. Ordered fields, 
isomorphism, bounded below, bounded above, supremum, infimum, and 
complete are defined. 


What are the real numbers? From a geometric point of view (and a 
historical one as well) real numbers are quantities, i.e., lengths of segments, 
areas of surfaces, volumes of solids, etc. For example, once we have settled 
on a unit of length, i.e., a segment whose length we call 1, we can, using a 
compass and straightedge, construct segments of any rational length k/n. 
In some obvious sense then, the rational numbers are real numbers. 
Apparently it was an intellectual shock to the Pythagoreans to discover that 
there are some other real numbers, the so-called irrational ones. Indeed, the 
square root of 2 is a real number, since we can construct a segment the 
square of whose length is 2 by making a right triangle each of whose legs 
has length 1. (By the Pythagorean Theorem of plane geometry, the square 
of the hypotenuse of this triangle must equal 2.) And, Pythagoras proved 
that there is no rational number whose square is 2, thereby establishing that 
there are real numbers tha are not rational. See part (c) of [link]. 


Similarly, the area of a circle of radius 1 should be a real number; i.e., 7 
should be a real number. It wasn't until the late 1800's that Hermite showed 
that 7 is not a rational number. One difficulty is that to define 7 as the area 
of a circle of radius 1 we must first define what is meant by the “ area" of a 
circle, and this turns out to be no easy task. In fact, this naive, geometric 
approach to the definition of the real numbers turns out to be unsatisfactory 
in the sense that we are not able to prove or derive from these first 
principles certain intuitively obvious arithmetic results. For instance, how 
can we multiply or divide an area by a volume? How can we construct a 
segment of length the cube root of 2? And, what about negative numbers? 


Let us begin by presenting two properties we expect any set that we call the 
real numbers ought to possess. 


Algebraic Properties 
We should be able to add, multiply, divide, etc., real numbers. In short, we 
require the set of real numbers to be a field. 


Positivity Properties 

The second aspect of any set we think of as the real numbers is that it has 
some notion of direction, some notion of positivity. It is this aspect that will 
allow us to “compare” numbers, e.g., one number is larger than another. 
The mathematically precise way to discuss this notion is the following. 


A field F is called an ordered field if there exists a subset P C F' that 
satisfies the following two properties: 


1.Ifz,y € P, then z + yand zy are in P. 
2. If x € F’, then one and only one of the following three statements 
is true. 


iwc P, 
lic: =2 € .P,,and 
iii. c = 0. (This property is known as the law of tricotomy.) 


The elements of the set P are called positive elements of F’, and the 
elements x for which —z belong to P are called negative elements of F’. 


As a consequence of these properties of P, we may introduce in F’ a notion 
of order. 


If Fis an ordered field, and x and y are elements of F’, we say that 
x<yify—z« c€ P. Wesay that z < yifeitherz < yorx = y. 


We say thatz > yify<az,andz > yify<z. 


An ordered field satisfies the familiar laws of inequalities. They are 
consequences of the two properties of the set P. 
Exercise: 


Problem: 


Using the positivity properties above for an ordered field F’, together 
with the axioms for a field, derive the familiar laws of inequalities: 


a. (Transitivity) If x < yandy < z, thenz < z. 


b. 


Cc. 
d. 
e. 
i 
g. 


(Adding like inequalities) If  < yand z < w, then 

BE 2< YW. 

Ifx < yanda > 0, then az < ay. 

Ifx < yanda < 0, then ay < az. 

If0 <a< band0O <c < d, thenac < bd. 

Verify parts (a) through (e) with < replaced by <. 

If x and y are elements of F’, show that one and only one of the 
following three relations can hold: (i) x < y, (ii) x > y, (iii) 


c= yy, 


. Suppose x and y are elements of F’, and assume that « < y and 


y < az. Prove that x = y. 


Exercise: 


Problem: 


a. 


If F' is an ordered field, show that 1 € P;i.e., that 0 < 1. HINT: 
By the law of tricotomy, only one of the three possibilities holds 
for 1. Rule out the last two. 


. Show that F7 of [link] is not an ordered field; i.e., there is no 


subset P C F7 such that the two positivity properties can hold. 
HINT: Use part (a) and positivity property (1). 


. Prove that Q is an ordered field, where the set P is taken to be the 


usual set of positive rational numbers. That is, P consists of those 
rational numbers a/b for which both a and 6 are natural numbers. 


. Suppose Fis an ordered field and that x is a nonzero element of 


F. Show that for all natural numbers nnzx + 0. 


. (e) Show that, in an ordered field, every nonzero square is 


positive; i.e., if 4 0, then x? € P. 


We remarked earlier that there are many different examples of fields, and 
many of these are also ordered fields. Some fields, though technically 
different from each other, are really indistinguishable from the algebraic 
point of view, and we make this mathematically precise with the following 
definition. 


Let F) and F> be two ordered fields, and write P; and P> for the set of 
positive elements in F) and F5 respectively. A 1-1 correspondence J 
between F and F% is called an isomorphism if 


1. J(x+y) = J(x) + J(y) forall x,y € Fi. 
2. J(xy) = J(x)J(y) for all 2, y € Fy. 
3.x € P, if and only if J (x) € Py. 


REMARK. In general, if A; and Ag are two algebraic systems, then a 1-1 
correspondence between A, and Ag is called an isomorphism if it converts 
the algebraic structure on A, into the corresponding algebraic structure on 
Ap». 

Exercise: 


Problem: 


a. Let F’ be an ordered field. Define a function J : N —> F' by 
J(n) = n-1. Prove that J is an isomorphism of N onto a subset 


~ 


N of F’. That is, show that this correspondence is one-to-one and 
converts addition and multiplication in NV into addition and 
multiplication in F’. Give an example to show that this result is 
not true if F’ is merely a field and not an ordered field. 

b. Let F' be an ordered field. Define a function J : Q — F' by 
J (k/n) =k-1 x (n-1)~". Prove that J is an isomorphism of 
the ordered field @ onto a subset Q of the ordered field F’. 
Conclude that every ordered field F’ contains a subset that is 
isomorphic to the ordered field Q. 


REMARK. Part (b) of [link] shows that the ordered field Q is the smallest 
possible ordered field, in the sense that every other ordered field contains an 
isomorphic copy of Q. However, as mentioned earlier, the ordered field Q 
cannot suffice as the set of real numbers. There is no rational number whose 
Square is 2, and we want the square root of 2 to be a real number. See [link] 
below. What extra property is there about an ordered field F’ that will allow 


us to prove that numbers like J Qn, and so on are elements of F’? It turns 
out that the extra property we need is related to a quite subtle point 


concerning upper and lower bounds of sets. It gives us some initial 
indication that the known-to-be subtle concept of a limit may be 
fundamental to our very notion of what the real numbers are. 


If S is a subset of an ordered field F’, then an element x € F is called 
an upper bound for S if x > y for every y € S. An element z is called 
a lower bound for S if z < y for every y € S. 


A subset S' of an ordered field F' is called bounded above if it has an 
upper bound; it is called bounded below if it has a lower bound; and it 
is called bounded if it has both an upper bound and a lower bound. 


An element M is called the least upper bound or supremum of a set S 
if it is an upper bound for S and if M < z for every other upper bound 
x of S. That is, M is less than or equal to any other upper bound of S. 


Similarly, an element m is called the greatest lower bound or infimum 
of Sif it is a lower bound for S and if z < m for every other lower 
bound z of S. That is, m is greater than or equal to any other lower 
bound of S. 


Clearly, the supremum and infimum of a set S are unique. For instance, if 
M and M' are both least upper bounds of a set S,, then they are both upper 
bounds of .S. We would then have M < M' and M' < M. Therefore, by 
part (h) of [link], M = M’. 


It is important to keep in mind that an upper bound of a set S need not be an 
element of S, and in particular, the least upper bound of S may or may not 
actually belong to S. 


If M is the supremum of a set S, we denote M by sup S. If m is the 
infimum of a set S, we denote it by inf S. 
Exercise: 


Problem: 
a. Suppose S$ is a nonempty subset of an ordered field F' and that x 


is an element of F’. What does it mean to say that “x is not an 
upper bound for S$?” 


b. 


Cc. 


Let F be an ordered field, and let S be the empty set, thought of 
as a subset of F’. Prove that every element x € F' is an upper 
bound for S and that every element y € F is a lower bound for S. 
HINT: If not, then what? 

If S = @, show that S' has no least upper bound and no greatest 
lower bound. 


REMARK. The preceding exercise shows that peculiar things about upper 
and lower bounds happen when S is the empty set. One point is that just 
because a set has an upper bound does not mean it has to have a least upper 


bound. 


That is, no matter which upper bound we choose, there is always 


another one that is strictly smaller. This is a very subtle point, and it is in 
fact quite difficult to give a simple concrete example of this phenomenon. 
See the remark following [link]. However, part (d) of [link] contains the 
seed of an example. 

Exercise: 


Problem: 


A natural number a is called even if there exists a natural number c 
such that a = 2c, and a is called odd if there exists a natural number c 
such that a = 2c +1. 


a. 


b. 


Prove by induction that every natural number is either odd or 
even. 

Prove that a natural number a is even if and only if a? = a x ais 
even. 


. Prove that there is no element x of Q whose square is 2. That is, 


the square root of 2 is not a rational number. HINT: Argue by 
contradiction. Suppose there is a rational number &/n for which 
k?/n* = 2, and assume, as we may, that the natural numbers k 
and n have no common factor. Observe that k must be even, and 
then observe that n also must be even. 


. Let S be the set of all positive rational numbers x for which 


a? = ¢ X & < 2. Prove that S has an upper bound and a lower 


bound. Can you determine whether or not S' has a least upper 
bound? 


The existence of least upper bounds and greatest lower bounds of 
bounded sets turns out to be the critical idea in defining the real 
numbers. It is precisely the existence of such suprema and infimas that 
enables us to define as real numbers quantities such as V2 ,7,e€, and so 
on. 


An ordered field F is called complete if every nonempty subset S of F 
that has an upper bound has a least upper bound. 


REMARK. Although Q is an ordered field, we will see that it is not a 
complete ordered field. In fact, the answer to part (d) of [link] is no. The set 
described there, though bounded above, has no least upper bound. In fact, it 
was one of nineteenth century mathematicians’ major achievements to prove 
the following theorem. 


There exists a complete ordered field. 
We leave the proof of this theorem to the appendix. 


Perhaps the most reassuring result along these lines is the following 
companion theorem, whose proof we also leave to the appendix. 


If F, and Fy, are two complete ordered fields, then they are isomorphic. 


Taken together, the content of the two preceding theorems is that, up to 
isomorphism, there exists one and only one complete ordered field. For no 
other reason that that, this special field should be an important object in 
mathematics. Our definition of the real numbers is then the following: 


By the set R of real numbers we mean the (unique) complete ordered 
field. 


Properties of the Real Numbers 

Properties of the real numbers, discussing isomorphic subsets, nonempty 
subsets with a greatest lower bound, least upper bound properties, positive 
square roots, and other aspects of real numbers. 


The set R contains a subset that is isomorphic to the ordered field Q of 
rational numbers, and hence subsets that are isomorphic to N and Z. 


REMARK. The proof of [link] is immediate from part (b) of Exercise 1.7. 
In view of this theorem, we will simply think of the natural numbers, the 
integers, and the rational numbers as subsets of the real numbers. 


Having made a definition of the set of real numbers, it is incumbent upon us 
now to verify that this set R satisfies our intuitive notions about the reals. 
Indeed, we will show that \/2 is an element of R and hence is a real 
number (as plane geometry indicates it should be), and we will show in 
later chapters that there are elements of # that agree with our intuition 
about e and 7. Before we can proceed to these tasks, we must establish 
some special properties of the field R. The first, the next theorem, is simply 
an analog for lower bounds of the least upper bound condition that comes 
from the completeness property. 


If S is a nonempty subset of R that is bounded below, then there exists a 
greatest lower bound for S. 


Define T to be the set of all real numbers x for which —z € S. That is, T is 
the set —S. We claim first that T’ is bounded above. Thus, let m be a lower 
bound for the set S, and let us show that the number —m is an upper bound 
for T. If x € T, then —x € S. So, m < —z, implying that —m > z. Since 
this is true for all x € 7’, the number —m™ is an upper bound for 7’. 


Now, by the completeness assumption, T' has a least upper bound Mp. We 
claim that the number —M, is the greatest lower bound for S. To prove 
this, we must check two things. First, we must show that —Mp is a lower 
bound for S. Thus, let y be an element of S. Then —y € T, and therefore 
—y < Mp. Hence, —Mpo < y, showing that —Mp is a lower bound for S. 


Finally, we must show that — Mp is the greatest lower bound for S. Thus, 
let m be a lower bound for S. We saw above that this implies that —m is an 
upper bound for 7’. Hence, because Mp is the least upper bound for T’, we 
have that —m > Mp, implying that m < — Mp, and this proves that —Mo 
is the infimum of the set S. 


The following is the most basic and frequently used property of least upper 
bounds. It is our first glimpse of “ limits.” Though the argument is 
remarkably short and sweet, it will provide the mechanism for many of our 
later proofs, so master this one. 


Let S be a nonempty subset of R that is bounded above, and Let Mo denote 
the least upper bound of S; i.e., Mo =sup S. Then, for any positive real 
number «€ there exists an element t of S such that t > Mo — e. 


Let € > 0 be given. Since Mp — € < Mg, it must be that Mo — « is not an 
upper bound for S’.. (Mp is necessarily less than or equal to any other upper 
bound of S.) Therefore, there exists an element t € S for which 

t > Mp — «. This is exactly what the theorem asserts. 

Exercise: 


Problem: 


a. Let S be a nonempty subset of R which is bounded below, and let 
Mo denote the infimum of S. Prove that, for every positive 6, 
there exists an element s of S such that s < mp + 6. Mimic the 
proof to [link]. 

b. Let S be any bounded subset of R, and write —S for the set of 
negatives of the elements of S. Prove that sup (—S) = — inf S. 

c. Use part (b) to give an alternate proof of part (a) by using [link] 
and a minus sign. 


Exercise: 
Problem: 


a. Let S'be the set of all real numbers x for which x < 1. Give an 
example of an upper bound for S’.. What is the least upper bound 


of S? Is sup S an element of S? 

b. Let S' be the set of all 2 € R for which x? < 4. Give an example 
of an upper bound for S.. What is the least upper bound of S”? 
Does sup S belong to S? 


We show now that A contains elements other than the rational numbers in 
Q. Of course this holds for any complete ordered field. The next theorem 
makes this quite explicit. 


If x is a positive real number, then there exists a positive real number y 
such that y* = «. That is, every positive real number « has a positive 
square root in R. Moreover, there is only one positive square root of x. 


Let S be the set of positive real numbers t for which t? < «. Then S is 
nonempty Indeed, If « > 1, then 1 is in S because 
?=1x1<1x2=72.And,ifz < 1, then z itself is in S, because 
e=axnr<1xr=z. 


Also, S is bounded above. In fact, the number 1 + 2/2 is an upper bound 
of S. Indeed, arguing by contradiction, suppose there were a ¢ in S such 
thatt > 1+ 2/2. Then 


Equation: 


a>t>(1+2/2"?=1+2427/4>2, 


which is a contradiction. Therefore, 1 + 2/2 is an upper bound of S, and so 
S is bounded above. 


Now let y =sup S. We wish to show that y? = x. We show first that 
y” < a, and then we will show that y? > a. It will then follow from the 
tricotomy law that y” = x. We prove both these inequalities by 
contradiction. 


So, assume first that y? > x, and write a for the positive number x” — z. 
Let € be the positive number a/(2y), and, using Theorem 1.5, choose a 


t € S such thatt > y—e. Theny+t < (2y), andy—t < €=a/2y. So, 


a = y—2« 
PoP EP 
< y-? 
= (yt+t)(y—-t) 
< 2y(y—t) 
< 2ye 
(e4 
=A a ae 
— Qa, 


which is a contradiction. Therefore y is not greater than z. 


Now we show that y? is not less than x. Again, arguing by contradiction, 
suppose it is, and let € be the positive number x — y”. Choose a positive 
number 6 that is less than y and also less than €/(3y). Let s = y+ 6. Then 
s is not in S, whence s? > z, so that we must have 


e= 2-y’ 


ee ey ee ey, 
sta? 
(s+y)(s—y) 
(2y + 6)d 

3y6 

1G; 


IA 


/\ 


which again is a contradiction. 
This completes the proof that y* = 2; i.e., that x has a positive square root. 


Finally, if y/ were another positive number for which 1/ — x, we show that 
y = y by ruling out the other two cases: y < y/ and y > /. For instance, if 


y < y,, then we would have that y” < y”, giving that 
j=9 Sy =e, 
implying that x < 2, and this is a contradiction. 


If x is a positive real number, then the symbol 4/z will denote the 


unique positive number y for which y? = z. Of course, 0 denotes 
the number 0. 


REMARK Part (c) of [link] shows that the field Q contains no number 
whose square is 2, and [link] shows that the field R does contain a number 
whose square is 2. We have therefore “proved” that the real numbers is a 
larger set than the rational numbers. It may come as a surprise to learn that 
we only now have been able to prove that. Look back through the chapter to 
be sure. It follows also that Q itself is not a complete ordered field. If it 
were, it would be isomorphic to R, by Theorem 1.2, so that it would have to 
contain a square root of 2, which it does not. 


A real number z that is not a rational number, i.e., is not an element of 
the subset Q of R, is called an irrational number. 


Exercise: 


Problem: 


a. Prove that every positive real number has exactly 2 square roots, 
one positive (./z) and the other negative (—4/z). 

b. Prove that if 2 is a negative real number, then there is no real 
number y such that y? = z. 

c. Prove that the product of a nonzero rational number and an 
arbitrary irrational number must be irrational. Show by example 
that the sum and product of irrational numbers can be rational. 


Intervals and Approximation 
The absolute value is introduced, and open intervals and natural numbers 
are defined. Denseness of fields is discussed. 


We introduce next into the set of real numbers some geometric concepts, 
namely, a notion of distance between numbers. Of course this had to 
happen, for geometry is the very basis of mathematics. 


The absolute value of a real number z is denoted by |z| and is defined 


as follows: 
1. (i) |0| = 0. 
2. (ii) If x > O then |x| = z. 
3. (iii) If « < 0 (—ax > 0) then |a| = —z. 


We define the distanced(x, y) between two real numbers x and y by 
d(x,y) = |x — y|. 


Obviously, such definitions of absolute value and distance can be made in 
any ordered field. 
Exercise: 


Problem: Let x and y be real numbers. 


a. Show that |x| > 0, and that x < |z]. 
b. Prove the Triangle Inequality for absolute values. 
Equation: 


jz +y| < |x| + [yl 
HINT: Check the three cases x + y > 0,2 + y < 0, and 
e-py—0. 
c. Prove the so-called backward” triangle inequality. 
Equation: 


aay (el yl: 


HINT: Write |x| = |(a — y) + y|, and use part (b). 


d. Prove that |ry| = |az||y|. 

e. Prove that |x| = Vx? for all real numbers z. 

f. Prove the Triangle Inequality for the distance function. That is, 
show that 
Equation: 


d(x, y) < d(a,z) + d(z,y) 


forall @y,;2-€ ft. 


Exercise: 


Problem: 


a. Prove that x = y if |x — y| < e€ for every positive number e. 
HINT: Argue by contradiction. Suppose x # y, and take 
e= |x — y|/2. 

b. Prove that x = yif and only if — y< e€ andy — za < € for 
every positive e. 


Let a and 6 be real numbers for which a < b. By the open interval 
(a,b) we mean the set of all real numbers x for which a < x < 6b, and 
by the closed interval [a,b] we mean the set of all real numbers x for 
whicha < x < b. 


By (a, 00) we mean the set of all real numbers x for which a < x, and by 
[a, co) we mean the set of all real numbers x for whicha < z. 


Analogously, we define (—oo, b) and (—oo, bj to be respectively the set of 
all real numbers x for which x < b and the set of all real numbers z for 
which x < b. 

Exercise: 


Problem: 


a. Show that the intersection of two open intervals either is the 
empty set or it is again an open interval. 


b. Show that (a, b) = (—oo, b) N (a, oo). 

c. Let y be a fixed real number, and let € be a positive number. Show 
that the inequality | — y| < € is equivalent to the pair of 
inequalities 
Equation: 


y—e<xandz<yt+e; 


i.e., Show that z satisfies the first inequality if and only if it 
satisfies the two latter ones. Deduce that |2 — y| < e if and only 
if x is in the open interval (y — €, y + €). 


Here is one of those assertions that seems like an obvious fact. However, it 
requires a proof which we only now can give, for it depends on the 
completeness axiom, and in fact is false in some ordered fields. 


Let N denote the set of natural Numbers, thought of as a subset of R. Then 
N is not bounded above. 


Suppose false. Let M be an upper bound for the nonempty set NV, and let 
Mo be the least upper bound for NV. Taking e€ to be the positive number 1/2, 
and applying Theorem 1.5, we have that there exists an element k of N 
such that My — 1/2 < k. But then Mp —1/2+1<k+1, or, 

Mop +1/2<k+1.So0 My <k+1. But Mp > k+ 1 because Mp is an 
upper bound for VV. We have thus arrived at a contradiction, and the 
theorem is proved. 


REMARK As mentioned above, there do exist ordered fields F’ in which 
the subset Nis bounded above. Such fields give rise to what is called 
“nonstandard analysis,” and they were first introduced by Abraham 
Robinson in 1966. The fact that R is a complete ordered field is apparently 
crucial to be able to conclude the intuitively clear fact that the natural 
numbers have no upper bound. 


[link] presents another intuitively obvious fact, and this one is in some real 
sense the basis for many of our upcoming arguments about limits. It relies 


on the preceding theorem, is in fact just a corollary, so it has to be 
considered as a rather deep property of the real numbers; it is not something 
that works in every ordered field. 

Exercise: 


Problem: 


Prove that if € is a positive real number, then there exists a natural 
number NV such that 1/N < e. 


[link] and [link] show that the set Q of rational numbers is “everywhere 
dense” in the field R. That is, every real number can be approximated 
arbitrarily closely by rational numbers. Again, we point out that this result 
holds in any complete ordered field, and it is the completeness that is 
critical. 


Let a < b be two real numbers. Then there exists a rational number 
r = p/q in the open interval (a, b). In fact, there exist infinitely many 
rational numbers in the interval (a, b). 


If a < 0 and b > 0, then taking r = 0 satisfies the first statement of the 
theorem. Assume first that a > 0 and b > a. Let n be a natural number for 
which 1/n is less than the positive number b — a. (Here, we are using the 
completeness of the field, because we are referring to Theorem 1.7, where 
completeness was vital.) If a = 0, then b = b — a. Setting r = 1/n, we 
would have that a < r < b. So, again, the first part of the theorem would 
be proved in that case. 


Suppose then that a > 0, and choose the natural number gq to be such that 
1/q is less than the minimum of the two positive numbers a and b — a. 
Now, because the number aq is not an upper bound for the set NV, we may 
let p be the smallest natural number that is larger than ag. Set r = p/q. 


We have first that ag < p, implying that a < p/q = r. Also, because p is 
the smallest natural number larger than aq, we must have that p — 1 < aq. 
Therefore, (p — 1)/q < a, or (p/q) — (1/q) < a, implying that 


r=p/q<a+1/q<a+(b-—a) = b. Hence, a < randr < b, and the 
first statement of the theorem is proved when both a and b are nonnegative. 


If both a and 6 are nonpositive, then both —b and —a are nonnegative, and, 
using the first part of the proof, we can find a rational number r such that 
—b<r< —a.So,a < —r < ), and the first part of the theorem is proved 
in this case as well. 


Clearly, we may replace 6 by r and repeat the argument to obtain another 
rational r; such that a < ry < r < b. Then, replacing 6 by r; and repeating 
the argument, we get a third rational rz such thata < rg < ry <r <b. 
Continuing this procedure would lead to an infinite number of rationals, all 
between a and b. This proves the second statement of the theorem. 
Exercise: 


Problem: 


a. Let € > 0 be given, and let & be a nonnegative integer. Prove that 
there exists a rational number p/q such that 
Equation: 


ke < p/q < (k+1)e. 


b. Let x be a positive real number and let € be a positive real 
number. Prove that there exists a rational number p/q such that 
x —€ < p/q < &. State and prove an analogous result for 
negative numbers 2x. 


Exercise: 


Problem: 


a. If a and b are real numbers with a < 6, show that there is an 
irrational number z (not a rational number) between a and 8, i.e., 
with a < x < 6. HINT: Apply [link] to the numbers av/2 and 
bv/2. 

b. Conclude that within every open interval (a, b) there is a rational 
number and an irrational number. Are there necessarily infinitely 


many rationals and irrationals in (a, b)? 


The preceding exercise shows the “denseness” of the rationals and the 
irrationals in the reals. It is essentially clear from this that every real 
number is arbitrarily close to a rational number and an irrational one. 


The Geometric Progression and the Binomial Theorem 

The binomial theorem is introduced, the existences of nth roots of real numbers is 
explored, the binomial coefficient is defined, and a theorem providing a formula for 
the sum of a geometric progression is included. 


There are two special algebraic identities that hold in R (in fact in any field F 
whatsoever) that we emphasize. They are both proved by mathematical induction. 
The first is the formula for the sum of a geometric progression. 

Geometric Progression 


Let x be a real number, and let n be a natural number. Then, 


1.If z #1, then 


Equation: 
n +1 
Ges 1-2” 
<0 ea 2 
2. If « = 1, then 
Equation: 


en 
j=0 


The second claim is clear, since there are n + 1 summands and each is equal to 1. 


We prove the first claim by induction. Thus, if m = 1, then the assertion is true, 
since 
Equation: 


1 
Soa =a°+e' =14+2=(1+2) z 
j=0 


Now, supposing that the assertion is true for the natural number k, i.e., that 
Equation: 


+1 


Pe a, hee 
ues l—-a ’ 


let us show that the assertion holds for the natural number k + 1. Thus 
Equation: 


k+1 k 
Sa! =) So ai tah 
j=0 j=0 
k+1 
ae. ee 4 ght 
1-2 


_— k+l fi. k+l _ op kt2 


which completes the proof. 


The second algebraic formula we wish to emphasize is the Binomial Theorem. 
Before stating it, we must introduce some useful notation. 


Let n be a natural number. As earlier in this chapter, we define n! as follows: 
Equation: 


ni=nx (n—1)x (n—-2)x...« 2&1. 


For later notational convenience, we also define 0! to be 1. 


If k is any integer for which 0 < k < n, we define the binomial coefficient 


({,) by 
Equation: 
(") = n| _ nx (n—1)x (n—-2)x...x (n-k+1) 
k/ — kM(n—k)! k! 
Exercise: 
Problem: 


a, Prove that (¢) =1, (7) = and (") =1. 


b. Prove that 
Equation: 


(")< 


for all natural numbers n and all integers 0 < k < n. 


c. Prove that 
n+1 n n 
( k )- +Ge 1) 


Equation: 
for all natural numbers n and all integers 1 << k <n. 


If z,y € Rand nis a natural number, then 
Equation: 


(n+y)"= (Cet 


We shall prove this theorem by induction. If nm = 1, then the assertion is true, for 
(c+ y)' =a+yand 


Equation: 
4/1 1 1 
So(D att = (Hates (Paty a4 
i 0 1 


Now, assume that the assertion holds for the natural number 7; i.e., 
Equation: 


(e+y) = > P) ci eee 


k=0 


and let us prove that the assertion holds for the natural number 7 + 1. We will make 
use of part (c) of [link]. We have that 


Equation: 


(x+y) 


1 
wu j k, j+1—k 
j j+1- 
x +O) y 
MF Vege PR 
+ y0 k xy ry 
k=l 
j ; 
4 (2 i) 4 (3) Jet 
k=1 . 
j : 
gee fink 4 itl 
k=l 
+ ea 7 LIN: oo 
Gia) j+1 (77 ) kayitl + (75 je j+l 
k=l 
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which shows that the assertion of the theorem holds for the natural number 7 + 1. 
This completes the proof. 


The next exercise is valid in any ordered field, but, since we are mainly interested 
in the order field R, we state everything in terms of that field. 
Exercise: 


Problem: 


a. If x and y are positive real numbers, and if n and k are natural numbers 
with k < n, show that (x + y)” > (peg 

b. For any positive real number z and natural number n, show that 
(l+2)" >1+4+n2. 

c. For any real number z > —1 and natural number n, prove that 
(1+2)" >1+ nz. HINT: Do not try to use the binomial theorem as in 
part (b); it won't work because the terms are not all positive; prove this 
directly by induction. 


There is one more important algebraic identity, which again can be proved by 
induction. It is actually just a corollary of the geometric progression formula. 


If z,y € Rand nis a natural number, then 
Equation: 


n—1 
x —y" = (2—y)(>_ aly? 4, 
j=0 


If m = 1 the theorem is clear. Suppose it holds for a natural number k, and let us 
prove the identity for the natural number k + 1. We have 
Equation: 


= (x-y)a*+y(a*—y*) 
k-1 
= (eae yey) ( a) 
j=0 
k-1 
Sea i)e haa eur? 
j=0 
k-1 
= (x-y) (ov an So aly’ ) 
j=0 
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, which shows that the assertion holds for the natural number k + 1. So, by 
induction, the theorem is proved. 
Exercise: 


Problem: Let z and y be real numbers. 


a. Let n be an odd natural number; i.e.,n = 2k + 1 for some natural 
number k. Show that 
Equation: 


HINT: Write x” + y” = 2” — (—y)". 
b. Show that x? + y” can not be factored into a product of the form 
(ax + by)(cx + dy) for any choices of real numbers a, b, c, and d. 


Using the Binomial Theorem together with the preceding theorem, we may now 
investigate the existence of nth roots of real numbers. This next theorem is 
definitely not valid in any ordered field, for it again depends on the completeness 


property. 


Let n be a natural number and let x be a positive real number. Then there exists a 
unique positive real number y such that y” = 2; i.e., x has a unique positive nth 
root. 


Note first that if 0 < t < s, then t” < s”. (To see this, argue by induction, and use 
part (e) of [link].) Using this, we mimic the proof of [link]. Thus, let S' be the set of 
all positive real numbers t for which t” < x. Then S is nonempty and bounded 
above. Indeed, if x > 1, then 1 € S, while if x < 1, then z itself is in S. 
Therefore, S is nonempty. Also, using part (b) of [link], we see that 1 + (a/n) is an 
upper bound for S. For, ift > 1+ a/n, then 

Equation: 


t” > (1+ (a/n))" >1+n(a2/n) >a. 


Now let y =sup S, and let us show that y” = x. We rule out the other two 
possibilities. First, if y” > x, let € be the positive number y” — x, and define e’ to 
be the positive number €/(ny”~*). Then, using [link], choose t € S' so that 

y —e' <t < y. ({link] is where the completeness of the ordered field R is crucial.) 
We have 

Equation: 


I/\ 
Kay 
3 
| 
Cs 
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(y—t) (Ss yo) 
j=0 


n—-1 
< o-0)( vi) 
j=0 
n—-1 
= v-0( yo 
j=0 
Zz e'ny” |! 


and this is a contradiction. Therefore, y” is not greater than x. 


Now, if y” < a, let € be the positive number x — y”, and choose a 6 > O such that 
6 < landd < €/(y+ 1)”. Then, using the Binomial Theorem, we have that 
Equation: 


(yoy = (ye 


| 
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3 
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implying that y+ 6 € S. But this is a contradiction, since y =sup S. Therefore, y” 
is not less than x, and so y” = a. 


We have shown the existence of a positive nth root of z. To see the uniqueness, 
suppose y and y/’ are two positive nth roots of x. Then 
Equation: 


4) = y” _ yl 


n—-1 ; 
= (y-y) (S eye 
7-0 


m-J-! — Q, Since this latter sum 


which implies that either y — y’ = 0 or De yy 
consists of positive terms, it cannot be 0, whence y = y’. This shows that there is 
but one positive nth root of x, and the theorem is proved. 


Exercise: 


Problem: 


a. Show that ifn = 2k is an even natural number, then every positive real 
number has exactly two distinct nth roots. 

b. If m = 2k + 1 is an odd natural number, show that every real number has 
exactly one nth root. 

c. If n is a natural number greater than 1, prove that there is no rational 
number whose nth power equals 2, i.e., the nth root of 2 is not a rational 
number. 


The Complex Numbers 

Complex numbers are covered, involving i. The fundamental theorem of algebra is 
referenced. The absolute value of a complex number is defined. The triangle 
inequality is stated. 


It is useful to build from the real numbers another number system called the complex 
numbers. Although the real numbers R have many of the properties we expect, i.e., 
every positive number has a positive square root, every number has a cube root, and 
so on, there are somewhat less prominent properties that R fails to possess. For 
instance, negative numbers do not have square roots. This is actually a property that 
is missing in any ordered field, since every square is positive in an ordered field. See 
part (e) of [link]. One way of describing this shortcoming on the part of the real 
numbers is to note that the equation 1 + 2? = 0 has no solution in the real numbers. 
Any solution would have to be a number whose square is —1, and no real number 
has that property. As an initial extension of the set of real numbers, why not build a 
number system in which this equation has a solution? 


We faced a similar kind of problem earlier on. In the set VV there is no element 7 such 
that 7 +n = n for all n € N. That is, there was no element like 0 in the natural 
numbers. The solution to the problem in that case was simply to “create” something 
called zero, and just adjoin it to our set N.. The same kind of solution exists for us 
now. Let us invent an additional number, this time denoted by 7, which has the 
property that its square 77 is —1. Because the square of any nonzero real number is 
positive, this new number 2 was traditionally referred to as an “imaginary” number. 
We simply adjoin this number to the set R, and we will then have a number whose 
square is negative, i.e., —1. Of course, we will require that our new number system 
should still be a field; we don't want to give up our basic algebraic operations. There 
are several implications of this requirement: First of all, if y is any real number, then 
we must also adjoin to R the number y x 2 = yt, for our new number system should 
be closed under multiplication. Of course the square of iy will equal i7y? = —y?, 
and therefore this new number zy must also be imaginary, i.e., not a real number. 
Secondly, if 2 and y are any two real numbers, we must have in our new system a 
number called x + yz, because our new system should be closed under addition. 


Let 2 denote an object whose square 12 = —1. Let C be the set of all objects 
that can be represented in the form z = x + yt, where both z and y are real 
numbers. 


Define two operations + and x on Cas follows: 
Equation: 


(e+ yi) + (a + yt) =a+a'+ yty')i, 


and 
Equation: 


(a + iy) (a + iy’) = va! + wiy’ + iya’ + iyiy’ = aa! — yy’ + (vy’ + yz’')i. 


1. The two operations + and x defined above are commutative and associative, 
and multiplication is distributive over addition. 

2. Each operation has an identity: (0 + 07) is the identity for addition, and 
(1 + 02) is the identity for multiplication. 

3. The set C’ with these operations is a field. 


We leave the proofs of Parts (1) and (2) to the following exercise. To see that Cis a 
field, we need to verify one final condition, and that is to show that if 
z=x2+yi40=0-+ 01, then there exists aw = u-+ vi such that 

zx w=1=1+4 01. Thus, suppose z = x + yi # 0. Then at least one of the two 
real numbers z and y must be nonzero, so that x? + y? > 0. Define a complex 
number w by 

Equation: 


x aa! ae 
24 as 24 hs 
Lr y L y 


We then have 


Equation: 
F x —yY 
zZxXWwW = x+y) xX ————— + ———~1 
( y ) (= aM y? 2 ui y? 
x? -y? -y 

= 46 y 
xz? + y? 0 

= rm Ia 
2 + y? 2 +y? 

= 14+01 


as desired. 
Exercise: 


Problem: Prove parts (1) and (2) of [link]. 


One might think that these kinds of improvements of the real numbers will go on and 
on. For instance, we might next have to create and adjoin another object 7 so that the 
number 7 has a square root; i.e., so that the equation 2 — z? = Ohas a solution. 
Fortunately and surprisingly, this is not necessary, as we will see when we finally 
come to the Fundamental Theorem of Algebra in [link]. 


The subset of C' consisting of the pairs x + 02 is a perfect (isomorphic) copy of the 
real number system FR. We are justified then in saying that the complex number 
system extends the real number system, and we will say that a real number z is the 
same as the complex number x + 02. That is, real numbers are special kinds of 
complex numbers. The complex numbers of the form 0 + yi are called purely 
imaginary numbers. Obviously, the only complex number that is both real and purely 
imaginary is the number 0 = 0 + 02. The set C can also be regarded as a 2- 
dimensional space, a plane, and it is also helpful to realize that the complex numbers 
form a 2-dimensional vector space over the field of real numbers. 


If z= x + yt, we say that the real number z is the real part of z and write 
x = A(z). We say that the real number y is the imaginary part of z and write 


y= 32). 


If z = x + yz is a complex number, define the complex conjugatez of z by 

2= oF =U. 
The complex number 3 satisfies i? = —1, showing that the negative number —1 has 
a square root in C, or equivalently that the equation 1 + z? = 0 has a solution in C. 
We have thus satisfied our initial goal of extending the real numbers. But what about 
other complex numbers? Do they have square roots, cube roots, nth roots? What 
about solutions to other kinds of equations than 1 + z?? 
Exercise: 


Problem: 


a. Prove that every complex number has a square root. HINT: Let 
z—a+bi. Assume w = x + yi satisfies w? = z, and just solve the two 
equations in two unknowns that arise. 

b. Prove that every quadratic equation az” + bz +c = 0, fora, b, and c 
complex numbers, has a solution in C’. HINT: If a = 0, it is easy to find a 
solution. If a 4 0, we need only find a solution to the equivalent equation 
Equation: 


b 
24-24 5=0, 
a a 


Justify the following algebraic manipulations, and then solve the equation. 
Equation: 
b? bf c 
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What about this new field C’? Does every complex number have a cube root, a fourth 
root, does every equation have a solution in C? A natural instinct would be to 
suspect that C’ takes care of square roots, but that it probably does not necessarily 
have higher order roots. However, the content of the Fundamental Theorem of 
Algebra, to be proved in [link], is that every equation of the form P(2) = 0, where 
P is anonconstant polynomial, has a solution in C’. This immediately implies that 
every complex number c has an nth root, for any solution of the equation z” — c = 0 
would be an nth root of c. 


The fact that the Fundamental Theorem of Algebra is true is a good indication that 
the field C is a “good” field. But it's not perfect. 


In no way can the field C’ be made into an ordered field. That is, there exists no 
subset P of C that satisfies the two positivity axioms. 


Suppose C’' were an ordered field, and write P for its set of positive elements. Then, 
since every square in an ordered field must be in P (part (e) of [link]), we must have 
that —1 = 7? must be in P. But, by part (a) of [link], we also must have that 1 is in 
P, and this leads to a contradiction of the law of tricotomy. We can't have both 1 and 
—1 in P. Therefore, C is not an ordered field. 


Although we may not define when one complex number is smaller than another, we 
can define the absolute value of a complex number and the distance between two of 
them. 


If z = x + yt is in C, we define the absolute value of z by 
Equation: 


= verry. 


We define the distanced(z, w) between two complex numbers z and w by 
d(z,w) = |z— wl. 


If c € Candr > 0, we define the open disk of radius r around c, and denote it 
by B, (c), by 
Equation: 


B,(c) = {z€C: |z-e| <r}. 


The closed disk of radius r around c is denoted by B, (c) and is defined by 
Equation: 


B, (ce) ={z€C: |z-—e] <r}. 


We also define open and closed punctured disks B’. (c) and B’, (c) around c by 
Equation: 


Bi. (ce) = {2:0 < |z-e| <r} 


and 
Equation: 


Bi (ec) = {2:0 < |z-c<r}. 


These punctured disks are just like the regular disks, except that they do not 
contain the central point c. 


More generally, if S is any subset of C’, we define the open neighborhood of 
radius r around S, denoted by N, (S), to be the set of all z such that there 
exists aw € S for which |z — w| < r. That is, N; (S) is the set of all complex 
numbers that are within a distance of r of the set S. We define the closed 
neighborhood of radius r around S, and denote it by N, (S), to be the set of all 
z € C for which there exists a w € S such that |z — w| <r. 


Exercise: 


Problem: 


feb) 


. Prove that the absolute value of a complex number 2 is a nonnegative real 
number. Show in addition that |z|? = zz. 

b. Let x be a real number. Show that the absolute value of z is the same 
whether we think of xz as areal number or as a complex number. 

. Prove that max (|SR(z)|, |5(z)|) < |z| < |R(z)| + |5(z)|. Note that this 
just amounts to verifying that 
Equation: 


io) 


max (|x|, |yl) < v/a? + y? < || + |y| 


for any two real numbers z and y. 
d. For any complex numbers z and w, show that z+ w = z+ w, and that 
22. 
Show that z+ z = 2 (z) and z — z = 2i3(z). 
If z=a-+ bi and w = a’ + b’4, prove that |zw| = |z||w|. HINT: Just 
compute |(a + bi) (a! + b/2)|”. 


rh © 


The next theorem is in a true sense the most often used inequality of mathematical 
analysis. We have already proved the triangle inequality for the absolute value of real 
numbers, and the proof was not very difficult in that case. For complex numbers, it is 
not at all simple, and this should be taken as a good indication that it is a deep result. 
Triangle Inequality 


If z and z’ are two complex numbers, then 
Equation: 


Je+2" |< |2|+|2! 
and 
Equation: 

jz—z' |> ||z|—|2" |]. 


We use the results contained in [link]. 
Equation: 


jete’* = (z4-7)\(z+2) 


= (z+2’) (z+2) 


zzt2z2+ 224+ 2/2! 
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The Triangle Inequality follows now by taking square roots. 


REMARK The Triangle Inequality is often used in conjunction with what's called 
the “add and subtract trick.” Frequently we want to estimate the size of a quantity 
like |z — w], and we can often accomplish this estimation by adding and subtracting 
the same thing within the absolute value bars: 

Equation: 


jz—w| = |z-—v+v-—w| <|z-v|+|v—-wI. 


The point is that we have replaced the estimation problem of the possibly unknown 
quantity |z — w| by the estimation problems of two other quantities |z — v| and 

|v — w|. It is often easier to estimate these latter two quantities, usually by an 
ingenious choice of v of course. 

Exercise: 


Problem: 


a. Prove the second assertion of the preceding theorem. 
b. Prove the Triangle Inequality for the distance function. That is, prove that 
Equation: 


d(z,w) < d(z,v) + d(v, w) 


for all z,w,u € C. 
c. Use mathematical induction to prove that 


Equation: 


n n 
d_ als lad 
i=1 i=1 


It may not be necessary to point out that part (b) of the preceding exercise provides a 
justification for the name “triangle inequality.” Indeed, part (b) of that exercise is just 
the assertion that the length of one side of a triangle in the plane is less than or equal 

to the sum of the lengths of the other two sides. Plot the three points z, w, and v, and 
see that this interpretation is correct. 


A subset S of C is called Bounded if there exists a real number / such that 
|z| < M for every zin S. 


Exercise: 


Problem: 


Let S be a subset of C’. Let S; be the subset of R consisting of the real 

parts of the complex numbers in S, and let S2 be the subset of R 

consisting of the imaginary parts of the elements of S. Prove that S is 
a. bounded if and only if S, and Sz are both bounded. 


HINT: Use Part (c) of [link].. 


Let S be the unit circle in the plane, i.e., the set of all complex numbers 
b. z = x + ty for which |z| = 1. Compute the sets $; and S2 of part (a). 


Exercise: 


Problem: 


a. Verify that the formulas for the sum of a geometric progression and the 
binomial theorem ((link] and [link]) are valid for complex numbers z and 
z'. HINT: Check that, as claimed, the proofs of those theorems work in any 
field. 

b. Prove [link] for complex numbers z and z’. 


Definition of the Number e 
A brief introduction of limits with links to important theorems. 


This chapter contains the beginnings of the most important, and probably 
the most subtle, notion in mathematical analysis, i.e., the concept of a limit. 
Though Newton and Leibniz discovered the calculus with its tangent lines 
described as limits of secant lines, and though the Greeks were already 
estimating areas of regions by a kind of limiting process, the precise notion 
of limit that we use today was not formulated until the 19th century by 
Cauchy and Weierstrass. 


The main results of this chapter are the following: 


. The definition of the limit of a sequence, 

. The definition of the real number e ([link]), 

. The Squeeze Theorem ((link]), 

. the Bolzano Weierstrass Theorem (([link] and [link]), 
. The Cauchy Criterion ([link]), 

. the definition of an infinite series, 

. the Comparison Test ([link]), and 

. the Alternating Series Test ({link]). 
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These are powerful basic results about limits that will serve us well in later 
chapters. 


Sequences and Limits 
A definition of a sequence of numbers is followed by the concept of 
convergence and limits. 


A sequence of real or complex numbers is defined to be a function 
from the set N of natural numbers into the setR or C’. Instead of 
referring to such a function as an assignment n — f(n), we ordinarily 
use the notation {a,},{an}7°, or {a1, a2, a3, ...}. Here, of course, an, 
denotes the number f(7n). 


REMARK We expand this definition slightly on occasion to make some of 
our notation more indicative. That is, we sometimes index the terms of a 
sequence beginning with an integer other than 1. For example, we write 


{an} ,{@0, a1, ...}, or even {an}. 


We give next what is the most significant definition in the whole of 
mathematical analysis, i.e., what it means for a sequence to converge or to 
have a limit. 


Let {a,, } be a sequence of real numbers and let L be a real number. 
The sequence {a,,} is said to converge to L, or that L is the limit of 
{an}, if the following condition is satisfied. For every positive number 
€, there exists a natural number NV such that ifn > N, then 

la, — L| <e. 


In symbols, we say L =lim a, or 
Equation: 


L=lim ay. 
n—- oo 


We also may write a, > L. 


If a sequence {a,,} of real or complex numbers converges to a number 
L, we say that the sequence {a,,} is convergent. 


We say that a sequence {a,,} of real numbers diverges to +00 if for 
every positive number /M, there exists a natural number NV such that if 
n > N,thena, > M. Note that we do not say that such a sequence is 
convergent. 


Similarly, we say that a sequence {a,,} of real numbers diverges to 
—oo if for every real number /, there exists a natural number NV such 
that ifn > N,thena, < M. 


The definition of convergence for a sequence {z,, } of complex 
numbers is exactly the same as for a sequence of real numbers. Thus, 
let {z,, } be a sequence of complex numbers and let L be a complex 
number. The sequence {z,, } is said to converge to L, or that L is the 
limit of {zn}, if the following condition is satisfied. For every positive 
number e, there exists a natural number NV such that ifn > N, then 
Zn — L| <e. 


REMARKS The natural number NV of the preceding definition surely 
depends on the positive number e. If €’ is a smaller positive number than e, 
then the corresponding N’ very likely will need to be larger than N. 
Sometimes we will indicate this dependence by writing N(e) instead of 
simply NV. It is always wise to remember that NV depends on e. On the other 
hand, the N or N(e) in this definition is not unique. It should be clear that 
if a natural number JV satisfies this definition, then any larger natural 
number / will also satisfy the definition. So, in fact, if there exists one 
natural number that works, then there exist infinitely many such natural 
numbers. 


It is clear, too, from the definition that whether or not a sequence is 
convergent only depends on the “tail” of the sequence. Specifically, for any 
positive integer A, the numbers aj, a2, ...,@K can take on any value 
whatsoever without affecting the convergence of the entire sequence. We 
are only concerned with a,,'s for nm > N, and as soon as NV is chosen to be 
greater than K, the first part of the sequence is irrelevant. 


The definition of convergence is given as a fairly complicated sentence, and 
there are several other ways of saying the same thing. Here are two: For 


every € > 0, there exists a N such that, whenever n > N,|a, — L| < e. 
And, given an € > 0, there exists a NV such that |a, — L| < ¢ for all n for 
which n > WN. It's a good idea to think about these two sentences and 
convince yourself that they really do “mean” the same thing as the one 
defining convergence. 


It is clear from this definition that we can't check whether a sequence 
converges or not unless we know the limit value L. The whole thrust of this 
definition has to do with estimating the quantity |a,, — L|. We will see later 
that there are ways to tell in advance that a sequence converges without 
knowing the value of the limit. 


Example: 

Let a, = 1/n, and let us show that lim a, = 0. Given an € > 0, let us 
choose a N such that 1/N < e. (How do we know we can find such a N’?) 
Now, if n > N, then we have 

Equation: 


which is exactly what we needed to show to conclude that 0 =lim ay. 


Example: 

Let a, = (2n + 1)/ (1 — 3n), and let L = —2/3. Let us show that 

L =lim a,. Indeed, if € > 0 is given, we must find a NV, such that if 

n > N then |a, + (2/3)| < e. Let us examine the quantity |a,, + 2/3]. 
Maybe we can make some estimates on it, in such a way that it becomes 
clear how to find the natural number JV. 

Equation: 


2n-+1 2 
at (2/3)| = = 
jan + (2/3)| = |e + S| 

z eet ase | 
7 3—9n 
= |r| 
~ 13—9n 
_ 5 
—  9n—3 
_ 5 
— 6n+3n—-—3 
5 
cl peta 
~ 6n 
1 
< poe) 
n 


for allm > 1. Therefore, if N is an integer for which N > 1/e, then 
Equation: 


lan +2/3| <1/n<1/N <e, 


whenever n > JN, as desired. (How do we know that there exists a NV 
which is larger than the number 1/e?) 


Example: 

Let an = 1/,/n, and let us show that lim a, = 0. Given an € > 0, we 
must find an integer N that satisfies the requirements of the definition. It's 
a little trickier this time to choose this NV. Consider the positive number e?. 
We know, from Exercise 1.16, that there exists a natural number NV such 
that 1/N < e?. Now, ifn > N, then 

Equation: 


1 ee ora 
aaa. <= a<ve = €, 


which shows that 0 =lim 1/,/n. 


REMARK A good way to attack a limit problem is to immediately 
examine the quantity |a,, — D|, which is what we did in [link] above. This 
is the quantity we eventually wish to show is less than €e when n > JN, and 
determining which N to use is always the hard part. Ordinarily, some 
algebraic manipulations can be performed on the expression |a,, — L| that 
can help us figure out exactly how to choose NV. Just know that this process 
takes some getting used to, so practice! 

Exercise: 


Problem: 


a. Using the basic definition, prove that lim 3/(2n + 7) = 0. 
b. Using the basic definition, prove that lim 1/ n? = 0. 
c. Using the basic definition, prove that 
lim (n? + 1)/ (n? + 100n) = 1. HINT: Use the idea from the 
remark above; i.e., examine the quantity ja, — L]. 
d. Again, using the basic definition, prove that 
Equation: 
2 . 
lim aia ai 
n-ni 


Remember the definition of the absolute value of a complex 


number. 
e. Using the basic definition, prove that 
Equation: 
_ m+n i. 
lim. ————— =i. 
1— ni 
f. Let a, = (—1)”. Prove that 1 is not the limit of the sequence 


{an}. HINT: Suppose the sequence {a,,} does converge to 1. Use 
€ = 1, let N be the corresponding integer that exists in the 
definition, satisfying |a, — 1] < 1 forall n > N, and then 


examine the quantity |a,, — 1] for various n's to get a 
contradiction. 


Exercise: 


Problem: 


a. Let {a,,} be a sequence of (real or complex) numbers, and let L 
be a number. Prove that L =lim a, if and only if for every 
positive integer k there exists an integer NV, such that ifn > N 
then |a,, — L| < 1/k. 

b. Let {c,,} be a sequence of complex numbers, and suppose that 
Cn +> L. If cn = an + byi and L = a + bi, show that 
a =lim a, and b =lim b,,. Conversely, if a =lim a, and 
b =lim b,, show that a + bi =lim (a, + by). That is, a 
sequence {cy = ay + bpi} of complex numbers converges if and 
only if the sequence {a,,} of the real parts converges and the 
sequence {b,,} of the imaginary parts converges. HINT: You need 
to show that, given some hypotheses, certain quantities are less 
than e. Part (c) of [link] should be of help. 


Exercise: 


Problem: 


a. Prove that a constant sequence (a, = c) converges to c. 
2n?24+1 
1—3n 
c. Prove that the sequence {(—1)"} does not converge to any 
number LZ. HINT: Argue by contradiction. Suppose it does 
converge to a number LD. Use € = 1/2, let N be the 
corresponding integer that exists in the definition, and then 
examine |a, — @n41| forn > N. Use the following useful add 
and subtract trick: 
Equation: 


b. Prove that the sequence { diverges to —oo. 


Ge — An+1 |=|an = da Ant+1 |<|an a L| cn |\L—ay41): 


Existence of Certain Fundamental Limits 

Nondecreasing, nonincreasing, eventually nondecreasing, and eventually 
nonincreasing sequences are defined. Convergence is then established for 
these sequences, and some practice exercises are included. 


We have, in the preceding exercises, seen that certain specific sequences 
converge. It's time to develop some general theory, something that will 
apply to lots of sequences, and something that will help us actually evaluate 
limits of certain sequences. 


A sequence {a,,} of real numbers is called nondecreasing if 

Qn < Qn+1 for all n, and it is called nonincreasing if an > an+1 for 
all n. It is called strictly increasing if an < Q@n+1 for all n, and strictly 
decreasing if an > Gn+1 for all n. 


A sequence {a,,} of real numbers is called eventually nondecreasing if 
there exists a natural number N such that an < an41 foralln > N, 
and it is called eventually nonincreasing if there exists a natural 
number NV such that a, > a@n4+41 for alln > N. We make analogous 
definitions of “eventually strictly increasing” and “eventually strictly 


decreasing.” 


It is ordinarily very difficult to tell whether a given sequence converges or 
not; and even if we know in theory that a sequence converges, it is still 
frequently difficult to tell what the limit is. The next theorem is therefore 
very useful. It is also very fundamental, for it makes explicit use of the 
existence of a least upper bound. 


Let {a,,} be a nondecreasing sequence of real numbers. Suppose that the set 
S of elements of the sequence {a,,} is bounded above. Then the sequence 
{an} is convergent, and the limit D is given byL =sup S =sup ay. 


Analogously, if {a,,} is a nonincreasing sequence that is bounded below, 
then {a,,} converges to inf ay. 


We prove the first statement. The second is done analogously, and we leave 
it to an exercise. Write L for the supremum sup a,,. Let € be a positive 
number. By Theorem 1.5, there exists an integer NV such thatay > DL —e, 


which implies that L — ay < e. Since {a,,} is nondecreasing, we then 
have thata, > ay > L—e foralln > N. Since L is an upper bound for 
the entire sequence, we know that L > a,, for every n, and so we have that 
Equation: 


|\L—a,|= L-a, < LD—ay <e 


for all nm > N. This completes the proof of the first assertion. 
Exercise: 


Problem: 


a. Prove the second assertion of the preceding theorem. 

b. Show that [link] holds for sequences that are eventually 
nondecreasing or eventually nonincreasing. (Re-read the remark 
following the definition of the limit of a sequence.) 


The next exercise again demonstrates the “denseness” of the rational and 
irrational numbers in the set F of all real numbers. 
Exercise: 


Problem: 


a. Let x be a real number. Prove that there exists a sequence {r,, } of 
rational numbers such that x =lim 7r,,. In fact, show that the 
sequence {r,,} can be chosen to be nondecreasing. HINT: For 
example, for each n, use [link] to choose a rational number r,, 
between x — 1/nandz. 

b. Let x be a real number. Prove that there exists a sequence {r’,, } 
of irrational numbers such that 2 =lim rj. 

c. Let z = x + ty be a complex number. Prove that there exists a 
sequence {a,} = {8, + in} of complex numbers that 
converges to z, such that each @,, and each yz, is a rational 
number. 


Exercise: 


Problem: 


Suppose {a,,} and {b,,} are two convergent sequences, and suppose 
that lim a, = a and lim b, = b. Prove that the sequence {a,, + b,} is 
convergent and that 

Equation: 


lim (a, +b,) =a+b. 


HINT: Use an €/2 argument. That is, choose a natural number NV; so 
that ja, — a| < €/2 for all m > Nj, and choose a natural number N»2 
so that |b, — b| < €/2 for all n > No. Then let N be the larger of the 
two numbers NV and No. 


The next theorem establishes the existence of four nontrivial and important 
limits. This time, the proofs are more tricky. Some clever idea will have to 
be used before we can tell how to choose the NV. 


1. Let z € C satisfy |z| < 1, and define a, = z”. then the sequence 
{a,} converges to 0. We write lim 2” = 0. 

2. Let b be a fixed positive number greater than 1, and define a,, = b!/”. 
See [link]. Then lim a,, = 1. Again, we write lim b!/" = 1. 

3. Let b be a positive number less than 1. Then lim p/n — 1, 

A. Ifa, = n'/”, then lim a, =lim n1/" = 1. 


We prove parts (1) and (2) and leave the rest of the proof to the exercise that 
follows. If z = 0, claim (1) is obvious. Assume then that z ~ 0, and let 

€ > 0 be given. Let w = 1/|z|, and observe that w > 1. So, we may write 
w = 1+h for some positive h. (That step is the clever idea for this 
argument.) Then, using the Binomial Theorem, w” > nh, and so 

1/w” < 1/ (nh). See part (a) of [link]. But then 

Equation: 


l2” — 0| =|2"|= [z|" = (1/w)” = 1/w" < 1/ (nh). 


So, if N is any natural number larger than 1/(€h), then 
Equation: 


"0 =B=lal"< > Sap < 
Ae AI Ce Gree fp 


for all m > N. This completes the proof of the first assertion of the 
theorem. 


To see part (2), writea, = b'/" =1+4+ ay, ie, 2, = b'/" — 1, and 
observe first that x, > 0. Indeed, since b > 1, it must be that the nth root 
b1/" is also > 1. (Why?) Therefore, x, = b!/" — 1 > 0. (Again, writing 
b!/” as 1 + ay is the clever idea.) Now, b = b!/"" = (1+ 2p)”, which, 
again by the Binomial Theorem, implies that b > 1+ nay. So, 
Ln < (b—1)/n, and therefore 
Equation: 

b—1 


bv" _1/=b/"_1=29”, < —— <e 
n 


whenever n > €/(b — 1), and this proves part (2). 
Exercise: 


Problem: 


a. Prove part (3) of the preceding theorem. HINT: For 6 < 1, use the 
following algebraic calculation: 
Equation: 


lov" — 1) = on — (1/8) I< | — 0/0)", 


and then use part (2) as applied to the positive number 1/b. 

b. Prove part (4) of the preceding theorem. Explain why it does not 
follow directly from part (2). HINT: Write n!/”" = 1+ hy. 
Observe that h,, > 0. Then use the third term of the binomial 
theorem in the expansion n = (1+ hn)”. 


c. Construct an alternate proof to part (2) of the preceding theorem 
as follows: Show that the sequence {ol o is nonincreasing and 
bounded below by 1. Deduce, from [link], that the sequence 
converges to a number L. Now prove that Z must be 1. 


Definition of e 
The theorem of the definition of e is introduced and proven. 


Part (4) of [link] raises an interesting point. Suppose we have a sequence 
{an}, like {n}, that is diverging to infinity, and suppose we have another 
sequence {b,}, like {1/n}, that is converging to 0. What can be said about 
the sequence {abn \? The base ay, is blowing up, while the exponent by, is 
going to 0. In other words, there are two competing processes going on. If 
Qn is blowing up, then its powers ought to be blowing up as well. On the 
other hand, anything to the 0 power should be 1, so that, as the exponents of 
the elements of a sequence converge to 0, the sequence ought to converge to 
1. This competition between the convergence of the base to infinity and the 
convergence of the exponent to 0 makes it subtle, if not impossibly difficult, 
to tell what the combination does. For the special case of part (4) of [link], 
the answer was 1, indicating that, in that case at least, the exponents going 
to 0 seem to be more important than the base going to infinity. One can 


think up all kinds of such examples: {ary b dean h deny}, 


and so on. We will see later that all sorts of things can happen. 


Of course there is the reverse situation. Suppose {a,, } is a sequence of 
numbers that decreases to 1, and suppose {b,,} is a sequence of numbers 
that diverges to infinity. What can we say about the sequence {an was The 
base is tending to 1, so that one might expect that the whole sequence also 
would be converging to 1. On the other hand the exponents are blowing up, 
so that one might think that the whole sequence should blow up as well. 
Again, there are lots of examples, and they don't all work the same way. 
Here is perhaps the most famous such example. 

Definition of e. 


For n > 1, define an = (1 + 1/n)”. Then the sequence {a, } is 
nondecreasing and bounded above, whence it is convergent. (We will 
denote the limit of this special sequence by the letter e.) 


To see that {a,,} is nondecreasing, it will suffice to prove that an+1/an > 1 
for all n. In the computation below, we will use the fact (part (c)of [link]) 
that if 2 > —1 then (1+ 2)” > 1+ nz. So, 


Equation: 


An+1 _ ( 
rcl —— aa aco ae 
An (1+ +) 


n+1 Le 
> = 1-m+n(=)) 
— nti i 1 ) 

n n+1 
—_ n+i1in 
7 n n+l 
= 1, 


as desired. 


We show next that {a,,} is bounded above. This time, we use the binomial 
theorem, the geometric progression, and [link]. 
Equation: 


rclan 


as desired. 


That the sequence {a,,} converges is now a consequence of [link]. 


REMARK We have now defined the real number e. Its central role in 
mathematics is not at all evident yet; at this point we have no definition of 
exponential function, logarithm, or trigonometric functions. It does follow 
from the proof above that e is between 2 and 4, and with a little more 
careful estimates we can show that actually e < 3. For the moment, we will 
omit any further discussion of its precise value. Later, in [link], we will 


show that it is an irrational number. 


Properties of Convergent Sequences 

A discussion of the properties of convergent sequences, such as when they 
form a bounded set, the squeeze theorem, and the properties of convergence 
when sequences are combined algebraically. 


Often, our goal is to show that a given sequence is convergent. However, as 
we study convergent sequences, we would like to establish various 
properties that they have in common. The first theorem of this section is just 
such a result. 


Suppose {a,,} is a convergent sequence of real or complex numbers. Then 
the sequence {a,,} forms a bounded set. 


Write L =lim a,. Let € be the positive number 1. Then, there exists a 
natural number NV such that |a, — L| < 1 for all nm > N. By the backward 
triangle inequality, this implies that ||a,,|—|Z|| < 1 for alln > N, which 
implies that |a,|< |Z| + 1 for alln > N. This shows that at least the tail of 
the sequence is bounded by the constant |Z| + 1. 


Next, let K be a number larger than the finitely many numbers 

|a1|, .--, |@v—1|. Then, for any n,|a@,| is either less than K or |Z| + 1. Let 
M be the larger of the two numbers K and |L| + 1. Then |a,,|< M for all 
n. Hence, the sequence {a,,} is bounded. 


Note that the preceding theorem is a partial converse to [link]; i.e., a 
convergent sequence is necessarily bounded. Of course, not every 
convergent sequence must be either nondecreasing or nonincreasing, so that 


a full converse to [link] is not true. For instance, take z = —1/2 in part (1) 
of [link]. It converges to 0 all right, but it is neither nondecreasing nor 
nonincreasing. 
Exercise: 

Problem: 


a. Suppose {a,,} is a sequence of real numbers that converges to a 
number a, and assume that a, > c for all n. Prove that a > c. 
HINT: Suppose not, and let € be the positive number c — a. Let 


N be a natural number corresponding to this choice of €, and 
derive a contradiction. 

b. If {a} is a sequence of real numbers for which lim a, = a, and 
if a ~ 0, then prove that a,, ~ 0 for all large enough n. Show in 
fact that there exists an N such that |a,,|> |a|/2 for alln > N. 
HINT: Make use of the positive number € = |a|/2. 


Exercise: 


Problem: 


a. If {a,,} is a sequence of positive real numbers for which 
lim a, = a > 0, prove that lim ,/a, = \/a. HINT: Multiply the 
expression ,/a;, — »/a above and below by ,/an + V/a. 

b. If {a,,} is a sequence of complex numbers, and lim a, = a, 
prove that lim|a,,|= |a|. HINT: Use the backward triangle 
inequality. 


Exercise: 
Problem: 
Suppose {a,,} is a sequence of real numbers and that L =lim a,. Let 


MM, and M, be real numbers such that M1 < a, < Mo for all n. Prove 
that M, < L < Mo. 


HINT: Suppose, for instance, that L > Mp. Make use of the positive 
number L — Mz to derive a contradiction. 


We are often able to show that a sequence converges by comparing it to 
another sequence that we already know converges. The following exercise 
demonstrates some of these techniques. 

Exercise: 


Problem: Let {a,,} be a sequence of complex numbers. 


a. Suppose that, for each n,|ay|< 1/n. Prove that 0 =lim ap. 


b. Suppose {b,,} is a sequence that converges to 0, and suppose that, 
for each n,|ay, |<|b,|. Prove that 0 =lim ay. 


The next result is perhaps the most powerful technique we have for 
showing that a given sequence converges to a given number. 
Squeeze Theorem 


Suppose that {a,,} is a sequence of real numbers and that {b,,} and {c,,} are 
two sequences of real numbers for which 6, < a, < c, for all n. Suppose 
further that lim b,, =lim c,, = DL. Then the sequence {a,,} also converges 
to L. 


We examine the quantity |a,, — L, | employ some add and subtract tricks, 
and make the following computations: 


Equation: 
rclja, —L| < |a,—b,+6,—L| 
<  |an — bp |+b, — L| 
= an—b,+|b,—L| 
< Cy — bat |b, — L| 
= |c, — b,|+|b, — L| 
< le, — D| + |L—b,|+|b, — LD}. 


So, we can make ja, — L| < € by making |c,, — L| < €/3 and 

|b, — L| < €/3. So, let Nj be a positive integer such that |c, — L| < €/3 if 
n > Ni, and let N2 be a positive integer so that |b, — L| < €/3 ifn > No. 
Then set NV =max (Nj, No). Clearly, ifn > N, then both inequalities 

|Cn — L| < €/3 and |b, — L| < €/3, and hence |a, — L| < e. This finishes 
the proof. 


The next result establishes what are frequently called the “limit theorems.” 
Basically, these results show how convergence interacts with algebraic 
operations. 


Let {a,} and {b,,} be two sequences of complex numbers with a =lim a, 
and b =lim b,,. Then 


1. The sequence {a,, + b,,} converges, and 
Equation: 


lim (ay + bp) =lim a,+ lim b, = a + b. 


2. The sequence {a,,b,,} is convergent, and 
Equation: 


lim (anb,) =lim a, lim b, = ab. 


3. If all the b,'s as well as b are nonzero, then the sequence {a,,/b,} is 
convergent, and 
Equation: 


i a, lima, _ a 
eS ee abe 


Part (1) is exactly the same as [link]. Let us prove part (2). 


By [link], both sequences {a,,} and {b,,} are bounded. Therefore, let / be 
a number such that |a,|< M and |b,|< M for all n. Now, let € > 0 be 
given. There exists an N; such that ja, — a| < €/ (2M) whenever n > Mi, 
and there exists an N2 such that |b, — b| < €/ (2M) whenever n > No. Let 
N be the maximum of N, and N». Here comes the add and subtract trick 
again. 

Equation: 


|Qnbp — aby, + ab, — abl 
< |anbp — ab,|+|ab, — abl 
= |a@n — al|bn|+|a||b—b,| 

la, —a|M + Mb, — d| 


€ 


rcl|anby — ab| 


qs 


N\A 


if n > N, which shows that lim (a,b,) = ab. 


To prove part (3), let V7 be as in the previous paragraph, and let e > 0 be 
given. There exists an NV, such that lan — al < (eb|”)/ (4M) whenever 
n > Nj; there also exists an Ny such that ln — b| < (e|b|”) / (4M) 


whenever n > No; and there exists an V3 such that |b, |> |b] /2 whenever 
n > Ns. (See [link].) Let NV be the maximum of the three numbers N,, N2 


and N3. Then: 
Equation: 
rel a a anb — ba 
b,  b b,b 
= |a,b— a 
|bnb| 
< |anb — bya| : 
|b)" /2 
2 
< (lan — alld] + a||on — b|/)—5 
[2] 
Z 


<s€ 


ifn > N. This completes the proof. 


REMARK The proof of part (3) of the preceding theorem may look 
mysterious. Where, for instance, does this number e|b|? /4M come from? 
The answer is that one begins such a proof by examining the quantity 
|Gn/b, — a/b] to see if by some algebraic manipulation one can discover 
how to control its size by using the quantities |a,, — a| and |b,, — b|. The 
assumption that a =lim a, and b =lim b, mean exactly that the quantities 
|@, — a| and |b,, — b| can be controlled by requiring n to be large enough. 
The algebraic computation in the proof above shows that 

Equation: 


An a 2 
Be GIS Ale — al + MPa — BD 


and one can then see exactly how small to make |a,, — a| and |b, — b| so 
that |a,,/b, — a/b| < e. Indeed, this is the way most limit proofs work. 
Exercise: 


Problem: 


If possible, determine the limits of the following sequences by using 
[link], [link], [link], and the squeeze theorem [link]. 


< (1 — i HINT: Note that 
Equation: 


ot 1 1 1 
Sg eee 


i {(1=1/ (2n))™ 


j. {(nynh 


Subsequences and Cluster Points 

An introduction of the convergence of subsequences, the bolzano- 
weierstrass theorem, cluster sets, suprema, infima, and the catchy criterion. 
Multiple exercises and proofs are included. 


Let {a,,} be a sequence of real or complex numbers. A subsequence of 
{an} is a sequence {b;,} that is determined by the sequence {a,, } 
together with a strictly increasing sequence {n,} of natural numbers. 
The sequence {b;,} is defined by by, = an,. That is, the kth term of the 
sequence {b;,} is the nzth term of the original sequence {a,, }. 


Exercise: 


Problem: 


Prove that a subsequence of a subsequence of {a,,} is itself a 
subsequence of {a,,}. Thus, let {a,,} be a sequence of numbers, and 
let {0} = {an,} be a subsequence of {a,}. Suppose {cj} = {bz, } is 
a subsequence of the sequence {b;,}. Prove that {c,;} is a subsequence 
of {a,,}. What is the strictly increasing sequence {m,} of natural 
numbers for which cj = @m,? 


Here is an interesting generalization of the notion of the limit of a sequence. 


Let {a,,} be a sequence of real or complex numbers. A number z is 
called a cluster point of the sequence {a,,} if there exists a 
subsequence {b;} of {a,,} such that 2 =lim b,x. The set of all cluster 
points of a sequence {a,, } is called the cluster set of the sequence. 


Exercise: 


Problem: 


a. Give an example of a sequence whose cluster set contains two 
points. Give an example of a sequence whose cluster set contains 
exactly n points. Can you think of a sequence whose cluster set is 
infinite? 


b. Let {a,,} be a sequence with cluster set S. What is the cluster set 
for the sequence {—a,,}? What is the cluster set for the sequence 
{anj? 

c. If {b,, } is a sequence for which b =lim b,,, and {a,,} is another 
sequence, what is the cluster set of the sequence {a,,b,,}? 

d. Give an example of a sequence whose cluster set is empty. 

e. Show that if the sequence {a,,} is bounded above, then the cluster 
set S is bounded above. Show also that if {a,,} is bounded below, 
then S' is bounded below. 

. Give an example of a sequence whose cluster set S' is bounded 
above but not bounded below. 

g. Give an example of a sequence that is not bounded, and which 

has exactly one cluster point. 


= 


Suppose {a,,} is a sequence of real or complex numbers. 


1. (Uniqueness of limits) Suppose lim a, = L, and lim a, = M. Then 
L = M. That is, if the limit of a sequence exists, it is unique. 

2. If L =lim a, and if {b,x} is a subsequence of {a,,}, then the 
sequence {b;,} is convergent, and lim b; = L. That is, if a sequence 
has a limit, then every subsequence is convergent and converges to that 
same limit. 


Suppose lim a,, = Land lim a, = M. Let € be a positive number, and 
choose N; so that ja, — L| < €/2 ifn > Nj, and choose Ny so that 

Jan — M| < €/2 ifn > No. Choose an n larger than both N;andN2. Then 
Equation: 


|Z — M| = |L—a, +a, — M| < |L—ay |+lan — M| < e. 
Therefore, since |Z — M| < e for every positive €, it follows that 
L— M = 0or L = M. This proves part (1). 


Next, suppose lim a,, = L and let {b,} be a subsequence of {a,,}. We 
wish to show that lim b, = LD. Let € > 0 be given, and choose an N such 


that |a, — L| < €ifn > N. Choose a K so that nx > N. (How?) Then, if 
k > K, we have ng > nx > N, whence |b, — L| =|an, — L| < €, which 
shows that lim 6; = L. This proves part (2). 


REMARK The preceding theorem has the following interpretation. It says 
that if a sequence converges to a number JL, then the cluster set of the 
sequence contains only one number, and that number is L. Indeed, if x is a 
cluster point of the sequence, then there must be some subsequence that 
converges to x. But, by part (2), every subsequence converges to L. Then, 
by part (1), « = L. Part (g) of [link] shows that the converse of this 
theorem is not valid. that is, the cluster set may contain only one point, and 
yet the sequence is not convergent. 


We give next what is probably the most useful fundamental result about 
sequences, the Bolzano-Weierstrass Theorem. It is this theorem that will 
enable us to derive many of the important properties of continuity, 
differentiability, and integrability. 

Bolzano-Weierstrass 


Every bounded sequence {a,,} of real or complex numbers has a cluster 
point. In other words, every bounded sequence has a convergent 
subsequence. 


The Bolzano-Weierstrass Theorem is, perhaps not surprisingly, a very 
difficult theorem to prove. We begin with a technical, but very helpful, 
lemma. 


Let {a,,} be a bounded sequence of real numbers; i.e., assume that there 
exists an M such that |a,,|< M for all n. For each n > 1, let S,, be the set 
whose elements are {Qn, Q@n41, 4n+2,---}. That is, S,, is just the elements of 
the tail of the sequence from n on. Define z, =sup S, =sup;,s,, az. Then 


1. The sequence {z,,} is bounded (above and below). 

2. The sequence {x,, } is non-increasing. 

3. The sequence {z,, } converges to a number z. 

4. The limit x of the sequence {x,,} is a cluster point of the sequence 
{a,}. That is, there exists a subsequence {b;} of the sequence {a,, } 


that converges to x. 

5. If y is any cluster point of the sequence {a,,}, then y < a, where z is 
the cluster point of part (4). That is, x is the maximum of all cluster 
points of the sequence {a, }. 


Since x, is the supremum of the set S,,, and since each element of that set 
is bounded between —M and M, part (1) is immediate. 


Since S,,,1 C S,,, it is clear that 
Equation: 


Lni1 =Sup Syi1 <sup Sp = Xn, 


showing part (2). 


The fact that the sequence {z,, } converges to a number z is then a 
consequence of [link]. 


We have to show that the limit x of the sequence {z,,} is a cluster point of 
{an}. Notice that {x,, } may not itself be a subsequence of {a,,}, each x, 
may or may not be one of the numbers ax, so that there really is something 
to prove. In fact, this is the hard part of this lemma. To finish the proof of 
part (4), we must define an increasing sequence {n;,} of natural numbers 
for which the corresponding subsequence {b;} = {an,} of {a,} converges 
to x. We will choose these natural numbers {nx} so that |r—an,|< 1/k. 
Once we have accomplished this, the fact that the corresponding 
subsequence 1t converges to x will be clear. We choose the n;'s 
inductively. First, using the fact that x =lim z,,, choose an n so that 

Zn — 2| = 2, — x < 1/1. Then, because x,, =sup S,, we may choose 
by [link] some m > n such that 2, > @m > 2, — 1/1. But then 

|Qm — £| < 1/1. (Why?) This m we call n,. We have that 

lan, — #| < 1/1. 


Next, again using the fact that x =lim z,,, choose another n so that n > n, 
and so that |v, — x| = 2, — x < 1/2. Then, since this x, =sup S,, we 


may choose another m > n such that 2, > @m > £, — 1/2. This m we 
call ng. Note that we have |an, — | < 1/2. 


Arguing by induction, if we have found an increasing set 

Ny < Ng <...< nj, for which |a,, — z| < 1/i for 1 <i < j, choose ann 
larger than n, such that |x, — z| < 1/(j +1). Then, since z,, =sup S_, 
choose anm > nso that 2, > Am > Lp —1/(j +1). Then 

lam — z| < 1/(j +1), and we let n;+1 be this m. It follows that 


lang, — 2] <1/(§ +2). 


So, by recursive definition, we have constructed a subsequence of {a,, } that 
converges to z, and this completes the proof of part (4) of the lemma. 


Finally, if y is any cluster point of {a,,}, and if y =lim a,,,, then n; > k, 
and so adn, < xx, implying that x, — an, > 0. Hence, taking limits on k, 


we see that z — y > O, and this proves part (5). 


Now, using the lemma, we can give the proof of the Bolzano-Weierstrass 
Theorem. 


If {a,,} is a sequence of real numbers, this theorem is an immediate 
consequence of part (4) of the preceding lemma. 


If An = by + Cnt is a sequence of complex numbers, and if {a,,} is 
bounded, then {b,,} and {c,,} are both bounded sequences of real numbers. 
See [link]. So, by the preceding paragraph, there exists a subsequence {b,, } 
of {b,,} that converges to a real number b. Now, the subsequence {Cp, } is 
itself a bounded sequence of real numbers, so there is a subsequence 


{en} that converges to a real number c. By part (2) of [link], we also 
have that the subsequence {bn,,} converges to b. So the subsequence 
{an.,$ = { bn, + cn, i} of {a,,} converges to the complex number 


b + ci; i.e., {an} has a cluster point. This completes the proof. 


There is an important result that is analogous to the Lemma above, and its 
proof is easily adapted from the proof of that lemma. 


Exercise: 


Problem: 


Let {a,} be a bounded sequence of real numbers. Define a sequence 
{yn} by Yn = infpsnag. Prove that: 


a. {y,} is nondecreasing and bounded above. 

b. y =lim yr, is a cluster point of {a,, }. 

c. If z is any cluster point of {a,,}, then y < z. That is, y is the 
minimum of all the cluster points of the sequence {a,,}. HINT: 
Let {a,,} = {—a,}, and apply the preceding lemma to {a,, }. 
This exercise will then follow from that. 


The Bolzano-Wierstrass Theorem shows that the cluster set of a bounded 
sequence {a,,} is nonempty. It is also a bounded set itself. 


The following definition is only for sequences of real numbers. However, 
like the Bolzano-Weierstrass Theorem, it is of very basic importance and 
will be used several times in the sequel. 


Let {a,,} be a sequence of real numbers and let denote its cluster set. 


If S is nonempty and bounded above, we define lim sup a, to be the 
supremum sup S of S. 


If S is nonempty and bounded below, we define lim inf a, to be the 
infimum infS of S. 


If the sequence {a,,} of real numbers is not bounded above, we define 
lim sup a, to be ov, and if {a,,} is not bounded below, we define 
lim inf a,, to be —oo. 


If {a,,} diverges to oo, then we define lim sup a,, and lim inf a, 
both to be oo. And, if {a,} diverges to —co, we define lim sup ay, 
and lim inf a,, both to be —oo. 


We call lim sup a,, the limit superior of the sequence {a,,}, and 
lim inf a,, the limit inferior of {a,}. 


Exercise: 


Problem: 


a. Suppose {a,,} is a bounded sequence of real numbers. Prove that 
the sequence {z,,} of the lemma following [link] converges to 
lim sup ay. Show also that the sequence {y,,} of [link] 
converges to lim inf ay. 

b. Let {a,,} be a not necessarily bounded sequence of real numbers. 


Prove that 
Equation: 
lim sup a, =infsup az, =limsup ag. 
nm k>n nm k>n 
and 
Equation: 


lim inf a, =supinf a, =liminf k > nag. 


HINT: Check all cases, and use [link] and [link]. 
c. Let {a,,} be a sequence of real numbers. Prove that 
Equation: 


lim sup a, = — lim inf (—a,). 


d. Give examples to show that all four of the following possibilities 


can happen. 
1. lim sup ay is finite, and lim inf a, = —oo. 
2. lim sup ay, = oo and lim inf a, is finite. 
3. lim sup a, = oo and lim inf a, = —oo. 


4. both lim sup a,, and lim inf a,, are finite. 


The notions of limsup and liminf are perhaps mysterious, and they are in 
fact difficult to grasp. The previous exercise describes them as the resultof a 
kind of two-level process, and there are occasions when this description is a 
great help. However, the limsup and liminf can also be characterized in 
other ways that are more reminiscent of the definition of a limit. These 
other ways are indicated in the next exercise. 

Exercise: 


Problem: Let {a,,} be a bounded sequence of real numbers with 


lim sup a, = Land lim inf a, = I. Prove that L and I satisfy the 
following properties. 


a. For each € > 0, there exists an NV such that a, < DL + e for all 
n > N. HINT: Use the fact that lim sup a, = L is the number x 
of the lemma following Theorem 2.8, and that x is the limit of a 
specific sequence {2 }. 

b. For each € > OQ, and any natural number k, there exists a natural 

number j > k such that a; > L — e. Same hint as for part (a). 

c. For each € > O, there exists an N such that a, > 1 — ¢ for all 

n>WN. 

d. For each € > 0, and any natural number k, there exists a natural 

number 7 > k such thata; <1+ e. 

e. Suppose L’ is a number that satisfies parts (a) and (b). Prove that 
L’ is the limsup of {a,,}. HINT: Use part (a) to show that L’ is 
greater than or equal to every cluster point of {a,,}. Then use part 
(b) to show that L’ is less than or equal to some cluster point. 

. If I’ is any number that satisfies parts (c) and (d), show that I’ is 
the liminf of the sequence {a, }. 


ear} 


Exercise: 
Problem: 
a. Let {a,,} and {b,,} be two bounded sequences of real numbers, 


and write L =lim sup a, and M =lim sup 6,,. Prove that 
lim sup (a, + 6,) <lim sup a,+ lim sup b,. HINT: Using 


part (a) of the preceding exercise, show that for every € > 0 there 
exists a N such thata, + 6, < L+M-+eforalln > N, and 
conclude from this that every cluster point y of the sequence 
{an + b,} is less than or equal to L + M. This will finish the 
proof, since lim sup (a,, + 0,,) is a cluster point of that sequence. 
b. Again, let {a,,} and {b,,} be two bounded sequences of real 
numbers, and write 1 —lim inf a,, and m =lim inf b,,. Prove that 
lim inf (a, + b,) >lim inf a,+ lim inf b,. HINT: Use part (c) 
of the previous exercise. 
c. Find examples of sequences {a,,} and {b,,} for which 
lim sup a, =lim sup b, = 1, but lim sup (a, + 6,,) = 0. 


We introduce next another property that a sequence can possess. It looks 
very like the definition of a convergent sequence, but it differs in a crucial 
way, and that is that this definition only concerns the elements of the 
sequence {a,,} and not the limit LD. 


A sequence {a,,} of real or complex numbers is a Cauchy sequence if 
for every € > 0, there exists a natural number NV such that ifn > N 
and m > N then jan — am|< e. 


REMARK No doubt, this definition has something to do with limits. Any 
time there is a positive € and an NV, we must be near some kind of limit 
notion. The point of the definition of a Cauchy sequence is that there is no 
explicit mention of what the limit is. It isn't that the terms of the sequence 
are getting closer and closer to some number J, it's that the terms of the 
sequence are getting closer and closer to each other. This subtle difference 
is worth some thought. 

Exercise: 


Problem: 


Prove that a Cauchy sequence is bounded. (Try to adjust the proof of 
[link] to work for this situation.) 


The next theorem, like the Bolzano-Weierstrass Theorem, seems to be quite 
abstract, but it also turns out to be a very useful tool for proving theorems 
about continity, differentiability, etc. In the proof, the completeness of the 
set of real numbers will be crucial. This theorem is not true in ordered fields 
that are not complete. 

Cauchy Criterion 


A sequence {a,,} of real or complex numbers is convergent if and only if it 
is a Cauchy sequence. 


If lima, = a then given € > 0, choose N so that |ax — a] < €/2 if 

k > N. From the triangle inequality, and by adding and subtracting a, we 
obtain that |a,, — a,,|< «ifn > Nandm > N. Hence, if {a,, } is 
convergent, then {a,,} is a Cauchy sequence. 


Conversely, if {a,,} is a cauchy sequence, then {a,,} is bounded by the 
previous exercise. Now we use the fact that {a,, } is a sequence of real or 
complex numbers. Let x be a cluster point of {a,,}. We know that one 
exists by the Bolzano-Weierstrass Theorem. Let us show that in fact this 
number z not only is a cluster point but that it is in fact the limit of the 
sequence {a,,}. Given € > 0, choose Nso that |an, — @m|< €/2 whenever 
both n andm > N. Let {a,,} be a subsequence of {a,,} that converges to 
x. Because {n;,} is strictly increasing, we may choose a k so that nz, > N 
and also so that |a,, — 2| < €/2. Then, ifn > N, then both n and this 
particular n; are larger than or equal to NV. Therefore, 

|@n — 2| <|an — an, |+lan, — x| < e. this completes the proof that 

= iy Gj: 


A Little Topology 

We now investigate some properties that subsets of R and C’' may possess. 
We will define “closed sets,” “open sets,” and “limit points” of sets. These 
notions are the rudimentary notions of what is called topology. As in earlier 
definitions, these topological ones will be enlightening when we come to 
continuity. 


We now investigate some properties that subsets of R and C' may possess. 
We will define “closed sets,” “open sets,” and “limit points” of sets. These 
notions are the rudimentary notions of what is called topology. As in earlier 
definitions, these topological ones will be enlightening when we come to 
continuity. 


Let S be a subset of C’. A complex number z is called a limit point of 
S if there exists a sequence {z,,} of elements of S such that 
ee =i 3,5. 


A set S C C is called closed if every limit point of S belongs to S. 


Every limit point of a set of real numbers is a real number. Closed intervals 
[a, b] are examples of closed sets in R, while open intervals and half-open 


intervals may not be closed sets. Similarly, closed disks B, (c) of radius r 


around a point c in C, and closed neighborhoods NV, (.S) of radius r around 
a set S C C, are closed sets, while the open disks or open neighborhoods 
are not closed sets. As a first example of a limit point of a set, we give the 
following exercise. 

Exercise: 


Problem: 


Let S be a nonempty bounded set of real numbers, and let M =sup S. 
Prove that there exists a sequence {a,,} of elements of S' such that 

M =lim a,. That is, prove that the supremum of a bounded set of real 
numbers is a limit point of that set. State and prove an analogous result 
for infs. 


HINT: Use [link], and let € run through the numbers 1/n. 


Exercise: 


Problem: 


a. 


is 


Suppose § is a set of real numbers, and that z = a + bi € C' with 
b ~ 0. Show that z is not a limit point of S. That is, every limit 
point of a set of real numbers is a real number. HINT: Suppose 
false; write a + bt =lim z,,, and make use of the positive 
number |]. 


. Let c be a complex number, and let S' = B, (c) be the set of all 


z € C for which |z — c| < r. Show that S is a closed subset of C 
. HINT: Use part (b) of [link]. 


. Show that the open disk B,. (0) is not a closed set in C’' by finding 


a limit point of B, (0) that is not in B, (0). 


. State and prove results analogous to parts b and c for intervals in 


i. 


. Show that every element x of a set S is a limit point of S. 
. Let S be a subset of C’, and let x be a complex number. Show that 


x is not a limit point of S if and only if there exists a positive 
number e€ such that if |y — z| < e, then y is not in S. That is, 

SM B. (x) = 0. HINT: To prove the “ only if” part, argue by 
contradiction, and use the sequence {1/n} as e's. 


. Let {a,,} be a sequence of complex numbers, and let S'be the set 


of all the a,,'s. What is the difference between a cluster point of 
the sequence {a,,} and a limit point of the set S? 


. (h) Prove that the cluster set of a sequence is a closed set. HINT: 


Use parts (e) and (f). 


Exercise: 


Problem: 


a. 


b. 


Show that the set @ of all rational numbers is not a closed set. 
Show also that the set of all irrational numbers is not a closed set. 
Show that if S is a closed subset of R that contains Q, then S 
must equal all of R. 


Here is another version of the Bolzano-Weierstrass Theorem, this time 
Stated in terms of closed sets rather than bounded sequences. 


Let S be a bounded and closed subset of C’. Then every sequence {z,, } of 
elements of S' has a subsequence that converges to an element of S. 


Let {x,,} be a sequence in S. Since S is bounded, we know by [link] that 
there exists a subsequence {z,,, } of {x,,} that converges to some number 
x. Since each x, belongs to S, it follows that z is a limit point of S. 
Finally, because S is a closed subset of C’, it then follows that x € S. 


We have defined the concept of a closed set. Now let's give the definition of 
an open set. 


Let S be a subset of C’. A point x € S is called an interior point of S 
if there exists an € > 0 such that the open disk B, (x) of radius € 
around z is entirely contained in S. The set of all interior points of S is 
denoted by S° and we call S° the interior of S. 


A subset S of C’ is called an open subset of C' if every point of S is an 
interior point of S; i.e., if S = S°. 


Analogously, let S be a subset of R. A point x € S is called an 
interior point of S if there exists an € > O such that the open interval 
(x — €,x + €) is entirely contained in S. Again, we denote the set of 
all interior points of S by S° and call S° the interior of S. 


A subset S of R is called an open subset of R if every point of S is an 
interior point of S;i.e., if S = §°, 


Exercise: 
Problem: 
a. Prove that an open interval (a, b) in R is an open subset of R; i.e., 
show that every point of (a, b) is an interior point of (a, b). 


b. Prove that any disk B, (c) is an open subset of C’. Show also that 
the punctured diskB;, (c) is an open set, where 


BY! (c) = {z:0< |z—c| < r}, ie. evrything in the disk B, (c) 
except the central point c. 

c. Prove that the neighborhood NV, (.S) of radius r around a set S is 
an open subset of C’. 

d. Prove that no nonempty subset of R is an open subset of C’. 

e. (e) Prove that the set @ of all rational numbers is not an open 
subset of R. We have seen in part (a) of [link] that Q is not a 
closed set. Consequently it is an example of a set that is neither 
open nor closed. Show that the set of all irrational numbers is 
neither open nor closed. 


We give next a useful application of the Bolzano-Weierstrass Theorem, or 
more precisely an application of [link]. This also provides some insight into 
the structure of open sets. 


Let S be a closed and bounded subset of C’, and suppose S is a subset of an 
open set U. Then there exists an r > O such that the neighborhood JN, (S) 
is contained in U. That is, every open set containing a closed and bounded 
set S actually contains a neighborhood of S. 


If Sis just a singleton {x}, then this theorem is asserting nothing more than 
the fact that x is in the interior of U, which it is if U is an open set. 
However, when S is an infinite set, then the result is more subtle. We argue 
by contradiction. Thus, suppose there is no such r > 0 for which 

N,. (S') © U. then for each positive integer n there must be a point z,, that 
is not in U, and a corresponding point y,, € S, such that |7, — yn|< 1/n. 
Otherwise, the number r = 1/n would satisfy the claim of the theorem. 
Now, because the y,,'s all belong to S, we know from [link] that a 
subsequence {yn, } of the sequence {y,,} must converge to a number 

y € S. Next, we see that 

Equation: 


1 
C59 Slee ne Te 8 a + |Y¥n, — yl 


and this quantity tends to 0. Hence, the subsequence {2,, } of the sequence 
{x,,} also converges to y. 


Finally, because y belongs to S and hence to the open set U, we know that 
there must exist an € > O such that the entire disk B, (y) C U. Then, since 
the subsequence {z,,, } converges to y, there must exist an; such that 

|Zn, — y| < €, implying that z,, € B, (y), and hence belongs to U. But 
this is our contradiction, because all of the z,,'s were not in U. So, the 
theorem is proved. 


We give next a result that clarifies to some extent the connection between 
open sets and closed sets. Always remember that there are sets that are 
neither open nor closed, and just because a set is not open does not mean 
that it is closed. 


A subset S' of C (R) is open if and only if its complement S = C \ S( 
R \ S) is closed. 


First, assume that S is open, and let us show that S is closed. Suppose not. 
We will derive a contradiction. Suppose then that there is a sequence {x} 


of elements of § that converges to a number z that is not in S; Le., zis an 
element of S. Since every element of S is an interior point of 3 , there must 
exist an € > 0 such that the entire disk B, (x) (or interval (2 — €, x + €)) is 
a subset of S. Now, since x =lim x, there must exist an such that 

Zn — 2| < € for every n > N. In particular, —2| <6 ie, 2y 
belongs to B, (x) (or (a — €, x + €)). This implies that xy € S. But 


INE 9 , and this is a contradiction. Hence, if S' is open, then S is closed. 


Conversely, assume that S$ is closed, and let us show that S must be open. 
Again we argue by contradiction. Thus, assuming that S is not open, there 
must exist a point x € S that is not an interior point of S. Hence, for every 
€ > 0 the disk B, (x) (or interval (a — €, x + €)) is not entirely contained 
in S. So, for each positive integer n, there must exist a point x, such that 
ham ¢ §. It follows then that x =lim z,,, and that each 


Zz, € S. Since § is a closed set, we must have that x € S.Butze S , and 


we have arrived at the desired contradiction. Hence, if Sis closed, then S' is 
open, and the theorem is proved. 


The theorem below, the famous Heine-Borel Theorem, gives an equivalent 
and different description of closed and bounded sets. This description is in 
terms of open sets, whereas the original definitions were interms of limit 
points. Any time we can find two very different descriptions of the same 
phenomenon, we have found something useful. 


Let S be a subset of C’' (respectively R). By an open cover of S we 
mean a sequence {U,,} of open subsets of C’' (respectively R) such that 
S C UU,; i.e., for every « € Sthere exists an n such that x € Uy. 


A subset S of C (respectively R) is called compact, or is said to satisfy 
the Heine-Borel property, if every open cover of S has a finite 
subcover. That is, if {U;,,} is an open cover of S, then there exists an 
integer N such that S C Belay ei n- In other words, only a finite number 
of the open sets are necessary to cover S. 


REMARK The definition we have given here for a set being compact is a 
little less general from the one found in books on topology. We have 
restricted the notion of an open cover to be a sequence of open sets, while 
in the general setting an open cover is just a collection of open sets. The 
distinction between a sequence of open sets and a collection of open sets is 
genuine in general topology, but it can be disregarded in the case of the 
topological spaces R and C. 

Heine-Borel Theorem 


A subset S of C (respectively R) is compact if and only if it is a closed and 
bounded set. 


We prove this theorem for subsets S of C’, and leave the proof for subsets of 
R to the exercises. 


Suppose first that S C C' is compact, i.e., satisfies the Heine-Borel 
property. For each positive integer n, define U,, to be the open set B,, (0). 
Then S C UU, because C' = UU,,. Hence, by the Heine-Borel property, 


there must exist an NV such that S C U/_,U,. But then S C By (0), 
implying that S is bounded. Indeed, |x| < N for all x € S. 


Next, still assuming that S' is compact, we will show that S is closed by 
showing that Sis open. Thus, let z be an element of S. For each positive 
integer n, define U;, to be the complement of the closed set By /,, (x). Then 
each U,, is an open set by Theorem 2.12, and we claim that {U,,} is an open 
cover of S. Indeed, if y € S, then y ~ x, and |y — z| > 0. Choose an n so 
that 1/n < |y— |. Then y ¢ By/,, (x), implying that y € U,. This proves 
our claim that {U,, } is an open cover of S. Now, by the Heine-Borel 
property, there exists an NV such that S' C U ade i n- But this implies that for 
every z € S we must have |z — x| > 1/N, and this implies that the disk 


By/n (2) is entirely contained in S. Therefore, every element x of S is an 


interior point of S. So, S is open, whence S$ is closed. This finishes the 
proof that compact sets are necessarily closed and bounded. 


Conversely, assume that S is both closed and bounded. We must show that 
S satisfies the Heine-Borel property. Suppose not. Then, there exists an 
open cover {U,,} that has no finite subcover. So, for each positive integer n 
there must exist an element x, € S for which x, ¢ U;_, Ux. Otherwise, 
there would be a finite subcover. By [link], there exists a subsequence 
{xn,} of {an} that converges to an element x of S. Now, because {U,,} is 


an open cover of S,, there must exist an N such that x € Uy. Because Uy 
is open, there exists an € > 0 so that the entire disk B, (x) is contained in 


Un. Since x —lim Zn,, there exists a J so that ie —a|<eif7> J. 
Therefore, if 7 > J, then z,, € Uy. But the sequence {n;} is strictly 
increasing, so that there exists a 7’ > J such thatn, > N, and by the 
choice of the point Tn, We know that Ln y g eg yz. We have arrived at a 
contradiction, and so the second half of the theorem is proved. 

Exercise: 


Problem: 


a. Prove that the union A U B of two open sets is open and the 
intersection A ™ B is also open. 


ms 


. Prove that the union A U B of two closed sets is closed and the 


intersection AM B is also closed. HINT: Use [link] and the set 


equations AU B= AN B,and AN B= AU B. These set 
equations are known as Demorgan's Laws. 


. Prove that the union A U B of two bounded sets is bounded and 


the intersection A M B is also bounded. 


. Prove that the union A U B of two compact sets is compact and 


the intersection AM B is also compact. 


. Prove that the intersection of a compact set and a closed set is 


compact. 


. Suppose S is a compact set in C' and r is a positive real number. 


Prove that the closed neighborhood N,, (S) of radius r around S 
is compact. HINT: To see that this set is closed, show that its 
coplement is open. 


Infinite Series 

Probably the most interesting and important examples of sequences are those that arise as 
the partial sums of an infinite series. In fact, it will be infinite series that allow us to 
explain such things as trigonometric and exponential functions. 


Probably the most interesting and important examples of sequences are those that arise as 
the partial sums of an infinite series. In fact, it will be infinite series that allow us to 
explain such things as trigonometric and exponential functions. 


Let {a,,}>° be a sequence of real or complex numbers. By the infinite series) * ay, 
we mean the sequence {Sy} defined by 
Equation: 


The sequence {Sy} is called the sequence of partial sums of the infinite series 
S* an, and the infinite series is said to be summable to a number S, or to be 
convergent, if the sequence {.S,v} of partial sums converges to S.The sum of an 
infinite series is the limit of its partial sums. 


An infinite series 5) a, is called absolutely summable or absolutely convergent if the 
infinite series 5 >|a,,| is convergent. 


If S> a, is not convergent, it is called divergent. If it is convergent but not absolutely 
convergent, it is called conditionally convergent. 


A few simple formulas relating the a,,'s and the S'y's are useful: 


Equation: 
Sn =ag9t+ai+ag+...+ayn, 
Equation: 
Snyi = Sw + any, 
and 
Equation: 


M 


Su — Sx = y Qn = 4k41 +aKi2+...+Am, 
n=K+1 


for M > K. 


REMARK Determining whether or not a given infinite series converges is one of the 
most important and subtle parts of analysis. Even the first few elementary theorems 
depend in deep ways on our previous development, particularly the Cauchy criterion. 


Let {a,,} be a sequence of nonnegative real numbers. Then the infinite series 5) a, is 
summable if and only if the sequence {Sy} of partial sums is bounded. 


If S$} a, is summable, then {Sv} is convergent, whence bounded according to [link]. 
Conversely, we see from the hypothesis that each a, > 0 that {Sy } is nondecreasing ( 
Swi1 = Sn + an+1 > Sy). So, if {Sy} is bounded, then it automatically converges by 
[link], and hence the infinite series 5+ a,, is summable. 


The next theorem is the first one most calculus students learn about infinite series. 
Unfortunately, it is often misinterpreted, so be careful! Both of the proofs to the next two 
theorems use [link], which again is a serious and fundamental result about the real 
numbers. Therefore, these two theorems must be deep results themselves. 


Let 5} a, be a convergent infinite series. Then the sequence {a,,} is convergent, and 
lim a, = 0. 


Because 5) a,, is summable, the sequence {Sy} is convergent and so is a Cauchy 
sequence. Therefore, given an € > 0, there exists an Np so that |S,, — Si,|< € whenever 
both n and m > No. Ifn > No, let m = n — 1. We have then that 

lan |=|Sn — Sm|< €, which completes the proof. 


REMARK Note that this theorem is not an “if and only if” theorem. The harmonic series 
(part (b) of [link] below) is the standard counterexample. The theorem above is mainly 
used to show that an infinite series is not summable. If we can prove that the sequence 
{a, } does not converge to 0, then the infinite series 5° a, does not converge. The 
misinterpretation of this result referred to above is exactly in trying to apply the (false) 
converse of this theorem. 


If $) ap is an absolutely convergent infinite series of complex numbers, then it is a 
convergent infinite series. (Absolute convergence implies convergence.) 


If {Sy} denotes the sequence of partial sums for }> a,,, and if {Ty} denotes the 
sequence of partial sums for >|a,,|, then 
Equation: 


M M 
Su — Sy || S- An|< S- an |=|Tu — Tn 


n=N-+1 n=N+1 


for all N and M. We are given that {Ty} is convergent and hence it is a Cauchy 
sequence. So, by the inequality above, {Sy} must also be a Cauchy sequence. (If 
Tv — Ty|< e, then |Sy — Sy|< € as well.) This implies that 5° a, is convergent. 
Exercise: 

The Infinite Geometric Series 


Problem: 
Let z be a complex number, and define a sequence {a,,} by a, = z”. Consider the 


infinite series 5} a,,. Show that pow Gn, converges to a number S if and only if 
|z| < 1. Show in fact that S = 1/(1 — z), when |z| < 1. 


HINT: Evaluate explicitly the partial sums Sy, and then take their limit. Show that 
Six _ t—2Nt 


Exercise: 


Problem: 
a. Show that °°, a0 converges to 1, by computing explicit formulas for the 
partial sums. HINT: Use a partial fraction decomposition for the a,,'s. 
b. (The Harmonic Series.) Show that }>°°_, 1/n diverges by verifying that 
Sx > k/2. HINT: Group the terms in the sum as follows, 
Equation: 


et ee a ee ee 
2 3. «4 5 6 7 8 9 10 ~ 16 ~ 


and then estimate the sum of each group. Remember this example as an infinite 
series that diverges, despite the fact that is terms tend to 0. 


The next theorem is the most important one we have concerning infinite series of 
numbers. 
Comparison Test 


Suppose {a,,} and {b,,} are two sequences of nonnegative real numbers for which there 
exists a positive integer MM and a constant C' such that b, < Ca, for alln > M. If the 
infinite series 5} a,, converges, so must the infinite series 5) by. 


We will show that the sequence {T'y } of partial sums of the infinite series 5) 6, is a 
bounded sequence. Then, by [link], the infinite series 5° b,, must be summable. 


Write Sy for the Nth partial sum of the convergent infinite series 5° a,,. Because this 
series is summable, its sequence of partial sums is a bounded sequence. Let B be a 


number such that Sy < B for all NV. We have for all N > M that 
Equation: 


rdIy = 


N 
= S> by, 
n=M+1 
M N 
< Sotnt+ SY) Can 
n=1 n=M+1 
M N 
= Soon tC An 
n=1 n=M-+1 


IA 
iMs 
: 
+ 
ee) 
M 

= 


IA 
iMs 
: 
+ 
SS 
KR 
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IA 
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+ 

Q 
irs 


which completes the proof, since this final quantity is a fixed constant. 
Exercise: 


Problem: 


a. Let {a,,} and {b,,} be as in the preceding theorem. Show that if 5) 6, diverges, 
then 5° a, also must diverge. 

b. Show by example that the hypothesis that the a,,'s and b,,'s of the Comparison 
Test are nonnegative can not be dropped. 


Exercise: 
The Ratio Test 


Problem: Let {a,, } be a sequence of positive numbers. 


a. If lim sup @n+4i1/an < 1, show that 5) a, converges. HINT: If 
lim sup @n41/@n = a < 1, let 8 be a number for which a < 8 < 1. Using 
part (a) of [link], show that there exists an NV such that for all mn > N we must 


have @ni1/@n < 8, or equivalently an+1 < Ban, and therefore ay, < Bray. 
Now use the comparison test with the geometric series 5) BY. 

b. If lim inf a,,41/an > 1, show that 5° a, diverges. 

c. As special cases of parts (a) and (b), show that {a,, } converges if 
limy Qn41/@n < 1, and diverges if lim, @n41/an > 1. 

d. Find two examples of infinite series' 5) a,, of positive numbers, such that 
lim a@n41/@n = 1 for both examples, and such that one infinite series converges 
and the other diverges. 


Exercise: 


Problem: 


a. Derive the Root Test: If {a,,} is a sequence of positive numbers for which 
lim sup ai/ "<1, then 5> a, converges. And, if lim inf ai! "> 1, then San 
diverges. 

b. Let r be a positive integer. Show that 5) 1/n” converges if and only if r > 2. 
HINT: Use [link] and the Comparison Test for r = 2. 

c. Show that the following infinite series are summable. 
Equation: 


$5 1/ (nr? +1), So 7n/2", S 5 a"/nl, 


for a any complex number. 


Exercise: 


Problem: 


Let {a,,} and {b,,} be sequences of complex numbers, and let {Sv} denote the 
sequence of partial sums of the infinite series 5° a,,. Derive the Abel Summation 
Formula: 

Equation: 


N N-1 
So anbn = Syby + S > Sp (bn — Bn st): 


The Comparison Test is the most powerful theorem we have about infinite series of 
positive terms. Of course, most series do not consist entirely of positive terms, so that the 
Comparison Test is not enough. The next theorem is therefore of much importance. 
Alternating Series Test 


Suppose {a1, @2, a3, ...} is an alternating sequence of real numbers; i.e., their signs 
alternate. Assume further that the sequence {|a,,| } is nonincreasing with 0 =lim|a,,|. 
Then the infinite series 5° a, converges. 


Assume, without loss of generality, that the odd terms a2,,+41 of the sequence {a,,} are 
positive and the even terms a2, are negative. We collect some facts about the partial sums 
Sy =a, +a2+... + ay of the infinite series 5° an. 


1. Every even partial sum Sg is less than the following odd partial sum 
Soni1 = Son + Qon+1, And every odd partial sum S2y +1 is greater than the 
following even partial sum Soy42 = Soen41 + Qon42. 

2. Every even partial sum Sy is less than or equal to the next even partial sum 
Sons2 = Son + Gon41 + Gon +2, implying that the sequence of even partial sums 
{Son} is nondecreasing. 

3. Every odd partial sum S2y+1 is greater than or equal to the next odd partial sum 
Son13 = Son+1 + Gon+2 + G2Nn +3, implying that the sequence of odd partial sums 
{.S2n+1} is nonincreasing. 

4. Every odd partial sum S2y7+1 is bounded below by S92. For, Sgvi1 > Son > So. 
And, every even partial sum Sx is bounded above by S}. For, Son < Sensi < S$}. 

5. Therefore, the sequence {S2, } of even partial sums is nondecreasing and bounded 
above. That sequence must then have a limit, which we denote by S_. Similarly, the 
sequence {.S2~+41} of odd partial sums is nonincreasing and bounded below. This 
sequence of partial sums also must have a limit, which we denote by So. 


Now 
Equation: 


56 = Se =lim Soni1— lim Son =lim (Son41 = Son) =lim Q2N41 = 0, 


showing that S, = S,, and we denote this common limit by S. Finally, given an € > 0, 
there exists an Ny so that |Sov — S| < €if 2N > Nj, and there exists an N2 so that 
|Son+1 — S| < €if 2N +1 > No. Therefore, if N >max (Nj, No), then 

|Siv — S| < e, and this proves that the infinite series converges. 

Exercise: 

The Alternating Harmonic Series 


Problem: 


a. Show that 5>°°_, (—1)"/n converges, but that it is not absolutely convergent. 
b. Let {a,,} be an alternating series, as in the preceding theorem. Show that the 

sum S = 5° a» is trapped between Sy and Sy+1, and that |S—Sy |<|ay]. 
c. State and prove a theorem about “eventually alternating infinite series.” 


d. Show that 5) z”/n converges if and only if |z| < 1, and z # 1. HINT: Use the 
Abel Summation Formula to evaluate the partial sums. 


Exercise: 


Problem: Let s = p/q be a positive rational number. 


a. For each x > 0, show that there exists a unique y > O such that y*® = 7; i.e., 
yo =i. 

b. Prove that 5) 1/n°* converges if s > 1 and diverges if s < 1. HINT: Group the 
terms as in part (b) of [link]. 


Test for Irrationality 


Let x be a real number, and suppose that {py /qy } is a sequence of rational numbers for 
which « =lim py /qy and « 4 py/qn for any N. If lim qy |x — py /qn| = 0, then x 
is irrational. 


We prove the contrapositive statement; i.e., if = p/q is a rational number, then 


lim qy |x — pw /qn| 4 0. We have 
Equation: 


dN a IPN 


xz — pn/qn = p/¢ — PN/GN 
qqNn 


Now the numerator pqy — gpw is not 0 for any N. For, if it were, then 
xz = p/q = pn/Qn, which we have assumed not to be the case. Therefore, since 
PqNn — Qpn is an integer, we have that 


Equation: 
PN — IPN 1 
|t—pw/qn |=|——_——|2 —_.. 
qaNn qqn| 
So, 
Equation: 


qn | — pw /qn| = jal 7° 


and this clearly does not converge to 0. 
Exercise: 


Problem: 


a. Let x = S09 (—1)"/2". Prove that z is a rational number. 


b. Let y = 53°, (—1)"/2”’. Prove that y is an irrational number. HINT: The 


n=0 
partial sums of this series are rational numbers. Now use the preceding theorem 
and part (b) of [link]. 


Functions and Continuity Definition of the Number tt 
An introduction, with links, to a subcollection about functions and 
continuity. 


The concept of a function is perhaps the most basic one in mathematical 
analysis. The objects of interest in our subject can often be represented as 
functions, and the “ unknowns” in our equations are frequently functions. 
Therefore, we will spend some time developing and understanding various 
kinds of functions, including functions defined by polynomials, by power 
series, and as limits of other functions. In particular, we introduce in this 
chapter the elementary transcendental functions. We begin with the familiar 
set theoretical notion of a function, and then move quickly to their 
analytical properties, specifically that of continuity. 


The main theorems of this chapter include: 


1. The Intermediate Value Theorem ((link]), 

2. the theorem that asserts that a continuous real-valued function on a 
compact set attains a maximum and minimum value ((link]), 

3. A continuous function on a compact set is uniformly continuous 
({link]), 

4. The Identity Theorem for Power Series Functions ((link]), 

5. The definition of the real number 

6. The theorem that asserts that the uniform limit of a sequence of 
continuous functions is continuous ((link]), and 

7. the Weierstrass  -Test ((link)). 


Functions 

A module about functions. Many terms, such as graph, real-valued, 
complex-valued, imaginary, bounded, even, odd, and others are defined. An 
exercise at the end involves some practice by proving statements and 
theorems related to these definitions of functions. 


Let S and T be sets. A function from S into T (notation f : S — T) is 
a rule that assigns to each element x in S a unique element denoted by 
f(xe) in: 


It is useful to think of a function as a mechanism or black box. We use 
the elements of S as inputs to the function, and the outputs are 
elements of the set 7’. 


If f : S > T is a function, then S is called the domain of f, and the 
set T’ is called the codomain of f. The range or image of f is the set of 
all elements y in the codomain 7’ for which there exists an x in the 
domain S such that y = f(a). We denote the range by f(.S). The 
codomain is the set of all potential outputs, while the range is the set of 
actual outputs. 


Suppose f is a function from a set S into a set 7’. If A C S, we write 
f(A) for the subset of T containing all the elements t € T for which 
there exists an s € A such that t = f(s). We call f(A) the image of A 
under f. Similarly, if B C T’, we write f~! (B) for the subset of S 
containing all the elements s € S such that f(s) € B, and we call the 
set f | (B) the inverse image or preimage of B. The symbol f~! (B) 
is a little confusing, since it could be misinterpreted as the image of the 
set B under a function called f~!. We will discuss inverse functions 
later on, but this notation is not meant to imply that the function f has 
an inverse. 


If f : S — T, then the graph of f is the subset G of the Cartesian 
product S x T' consisting of all the pairs of the form (x, f(x)). 


If f : S > Risa function, then we call f a real-valued function, and 
if f : S — C, then we call f a complex-valued function. If f : S > C 


is a complex-valued function, then for each x € S the complex 
number f(x) can be written as u(x) + iv(x), where u(x) and v(x) 
are the real and imaginary parts of the complex number f(x). The two 
real-valued functions u: S > Randv: S — Rare called 
respectively the real and imaginary parts of the complex-valued 
function f. 


If f: S > TandS C R, then f is called a function of a real variable, 
and if S C C, then f is called a function of a complex variable. 


If the range of f equals the codomain, then f is called onto. 


The function f : S — T is called one-to-one if f (x1) = f (x2) 
implies that 7; = 2. 


The domain of f is the set of x's for which f(z) is defined. If we are given 
a function f : S — T, we are free to regard f as having a smaller domain, 
ie., a subset S’ of S. Although this restricted function is in reality a 
different function, we usually continue to call it by the same name f. 
Enlarging the domain of a function, in some consistent manner, is often 
impossible, but is nevertheless frequently of great importance. The 
codomain of f is distinguished from the range of f, which is frequently a 
proper subset of the codomain. For example, since every real number is a 
complex number, any real-valued function f : S — R is also a (special 
kind of) complex-valued function. 


We consider in this book functions either of a real variable or of complex 
variable. that is, the domains of functions here will be subsets either of R or 
of C’. Frequently, we will indicate what kind of variable we are thinking of 
by denoting real variables with the letter x and complex variables with the 
letter z. Be careful about this, for this distinction is not always made. 


Many functions, though not all by any means, are defined by a single 
equation: 
Equation: 


y= 32 = 7, 


Equation: 


Equation: 


(How does this last equation define a function?) 
Equation: 


(1 —a?ylt)?? = (a/ (1 —y))*”. 


(How does this equation determine a function?) 


There are various types of functions, and they can be combined in a variety 
of ways to produce other functions. It is necessary therefore to spend a fair 
amount of time at the beginning of this chapter to present these definitions. 


If f and g are two complex-valued functions with the same domain S, 
ie, f: S— Candg: S — C, and if cis a complex number, we 
define f+ 9, fg, f/g (if g(a) is never 0), and cf by the familiar 


formulas: 
Equation: 
(f+ 9)(z) = f(x) + (2), 
Equation: 
(f9)(x) = flx)g(x), 
Equation: 


(f/9)(x) = flx)/9(2), 


and 
Equation: 


(cf)(«) = cf(z). 


If f and g are real-valued functions, we define functions max (f, g) 
and min (f, g) by 
Equation: 


[max (f, g)|(«) =max (f(x), 9(2)) 


(the maximum of the numbers f(z) and g(x)), and 
Equation: 


[min (f, 9)](«) =min (f(x), 9(2)), 


(the minimum of the two numbers f(a) and g(z)). 


If f is either a real-valued or a complex-valued function on a domain 
S, then we say that f is bounded if there exists a positive number M 
such that |f(x)| < M for alla € S. 


There are two special types of functions of a real or complex variable, the 
even functions and the odd functions. In fact, every function that is defined 
on all of R or C (or, more generally, any function whose domain S' equals 
— §) can be written uniquely as a sum of an even part and an odd part. This 
decomposition of a general function into simpler parts is frequently helpful. 


A function f whose domain S equals —S, is called an even function if 
f(—z) = f(z) for all z in its domain. It is called an odd function if 
f(—z) = —f(z) for all z in its domain. 


We next give the definition for perhaps the most familiar kinds of functions. 


A nonzero polynomial or polynomial function is a complex-valued 
function of a complex variable, p : C' — C, that is defined by a 


formula of the form 
Equation: 


n 
plz) = So axz* = ap tayz+ agz7 +... + an2”, 
k=0 


where the a,;'s are complex numbers and a, # 0. The integer n is 
called the degree of the polynomial p and is denoted by deg(p). The 
numbers Qo, Qj, ..-, @y, are Called the coefficients of the polynomial. 
The domain of a polynomial function is all of C; i.e., p(z) is defined 
for every complex number z. 


For technical reasons of consistency, the identically 0 function is called 
the zero polynomial. All of its coefficients are 0 and its degree is 
defined to be —oo. 


A rational function is a function r that is given by an equation of the 
form r(z) = p(z)/q(z), where q is a nonzero polynomial and p is a 
(possibly zero) polynomial. The domain of a rational function is the set 
S of all z € C for which q(z) £ 0, ie., for which r(z) is defined. 


Two other kinds of functions that are simple and important are step 
functions and polygonal functions. 


Let [a, b] be a closed bounded interval of real numbers. By a partition 
of [a, b] we mean a finite set P = {ay <a, <...< a,}ofn+1 
points, where 79 = aandz,, = b. 


The n intervals {{x;-1, 2;|}, for 1 < i < n, are called the closed 
subintervals of the partition P, and the n intervals {(a;_1, 2;)} are 
called the open subintervals of P. 


We write || P || for the maximum of the numbers (lengths of the 
subintervals) {x; — x;~1}, and call the number || P || the mesh size of 
the partition P. 


A function h : [a,b] + C is called a step function if there exists a 
partition P = {xp < 21 < ... < &p} of [a, b] and m numbers 

{@1, @2, ..-) Qn, } such that h (2) = a; if aj-1 < x < 2j. Thatis, hisa 
step function if it is a constant function on each of the (open) 
subintervals (x;_1, x;) determined by a partition P. Note that the 
values of a step function at the points {x;} of the partition are not 
restricted in any way. 


A function | : [a,b] > R is called a polygonal function, or a piecewise 
linear function, if there exists a partition P = {ap < 41 <... < @n} 
of [a, b] and n + 1 numbers {yo, y1, ---, Yn } such that for each 

x € [x;-1, x;],l(x) is given by the linear equation 

Equation: 


(ey = ye me = 2A); 


where m; = (yi — yi,)/ (@i — 2i_1). That is, 1 is a polygonal function 
if it is a linear function on each of the closed subintervals [x;_1, x;| 
determined by a partition P. Note that the values of a piecewise linear 
function at the points {x;} of the partition P are the same, whether we 
think of x; in the interval |x;_1, x;] or [x;, x;41]. (Check the two 
formulas for [(x;).) 


The graph of a piecewise linear function is the polygonal line joining 
the n + 1 points {(2;, y:)}. 


There is a natural generalization of the notion of a step function that 
works for any domain S, e.g., a rectangle in the plane C’. Thus, if S' is 
a set, we define a partition of S to be a finite collection 

{EF}, E,..., En} of subsets of S for which 


LUE 4h, = §, and 
2. aie, 


Then, a step function on S' would be a function h that is constant on 
each subset &;. We will encounter an even more elaborate generalized 


notion of a step function in Chapter V, but for now we will restrict our 
attention to step functions defined on intervals |a, 5]. 


The set of polynomials and the set of step functions are both closed 
under addition and multiplication, and the set of rational functions is 
closed under addition, multiplication, and division. 


Exercise: 


Problem: 


a. Prove that the sum and product of two polynomials is again a 
polynomial. Show that deg(p + q) <max (deg(p), deg(q)) and 
deg(pq) = deg(p) + deg(q). Show that a constant function is a 
polynomial, and that the degree of a nonzero constant function is 
0. 

b. Show that the set of step functions is closed under addition and 
multiplication. Show also that the maximum and minimum of two 
step functions is again a step function. (Be careful to note that 
different step functions may be determined by different partitions. 
For instance, a partition determining the sum of two step 
functions may be different from the partitions determining the 
two individual step functions.) Note, in fact, that a step function 
can be determined by infinitely many different partitions. Prove 
that the sum, the maximum, and the minimum of two piecewise 
linear functions is again a piecewise linear function. Show by 
example that the product of two piecewise linear functions need 
not be piecewise linear. 

c. Prove that the sum, product, and quotient of two rational 
functions is again a rational function. 

d. Prove the Root Theorem: If p(z) = >>; @x2* is a nonzero 
polynomial of degree n, and if c is a complex number for which 
p(c) = 0, then there exists a nonzero polynomial 


q(2) = am b;zI of degree n — 1 such that p(z) = (z — c)q(z) 
for all z. That is, if c is a “root” of p, then z — c is a factor of p. 
Show also that the leading coefficient b,1 of q equals the leading 
coefficient a, of p. HINT: Write 


s 


Equation: 


. Let f be a function whose domain S equals —S. Define functions 


f. and f, by the formulas 
Equation: 


f(z) + f(—2) 
2 


fle) = (2) 


fe (z) = 9 


and. {5 2). 
Show that f, is an even function, that f, is an odd function, and 
that f = f. + f,. Show also that, if f = g +h, where g is an 
even function and h is an odd function, then g = f, andh = fo. 
That is, there is only one way to write f as the sum of an even 
function and an odd function. 


. Use part (e) to show that a polynomial p is an even function if and 


only if its only nonzero coefficients are even ones, i.e., the a2,'s. 
Show also that a polynomial is an odd function if and only if its 
only nonzero coefficients are odd ones, i.e., the a2441's. 


. Suppose p (z) = >> 7-9 @2%27" is a polynomial that is an even 
function. Show that 
Equation: 
pz) =) C1) axe =n" ©), 
k=0 


where p® is the polynomial obtained from p by alternating the 
signs of its nonzero coefficients. 


If q(z) = peg @on412**"t is a polynomial that is an odd 


function, show that 
Equation: 


where again g® is the polynomial obtained from q by alternating 
the signs of its nonzero coefficients. 

i. If p is any polynomial, show that 
Equation: 


Pp (iz) = De (42) + Po (tz) = pe (2) + tps (2), 


and hence that p, (iz) = p% (z) and p, (iz) = ip? (2). 


Polynomial Functions 

A module about polynomial functions, with a theorem establishing some 
elementary properties of polynomial functions, such as the uniqueness of 
coefficients and the behavior at infinity. 


If p(z) = Dip ae2* and g(z) = oy bj? are two polynomials, it 
certainly seems clear that they determine the same function only if they 
have identical coefficients. This is true, but by no means an obvious fact. 
Also, it seems clear that, as |z| gets larger and larger, a polynomial function 
is more and more comparable to its leading term a,z”. We collect in the 
next theorem some elementary properties of polynomial functions, and in 
particular we verify the above “uniqueness of coefficients” result and the 
“behavior at infinity” result. 


1. Suppose p(z) = 5~7_, ax2* is a nonconstant polynomial of degree 
n > 0. Then p(z) = 0 for at most n distinct complex numbers. 

2. If r is a polynomial for which r(z) = 0 for an infinite number of 
distinct points, then r is the zero polynomial. That is, all of its 
coefficients are 0. 

3. Suppose p and q are nonzero polynomials, and assume that 
p(z) = q(z) for an infinite number of distinct points. Then 
p(z) = q(z) for all z, and p and q have the same coefficients. That is, 
they are the same polynomial. 

4. Let p(z) = ye j=0 c;z) be a polynomial of degree n > 0. Then there 
exist positive constants m and B such that 
Equation: 


|n| 


2 


lz" < |p (z)| < Mz)" 


for all complex numbers z for which |z| > B. That is, For all complex 
numbers z with |z| > B, the numbers |p(z)| and |z|" are 
“comparable.” 

5. If f : [0, 00) > C is defined by f (x) = 4/2, then there is no 
polynomial p for which f(x) = p(z) for all 2 > 0. That is, the square 
root function does not agree with any polynomial function. 


We prove part (1) using an argument by contradiction. Thus, suppose there 
does exist a counterexample to the claim, i.e., a nonzero polynomial p of 
degree n and n + 1 distinct points {c1, C2, ...,Cn41} for which p(c;) = 0 
for all 1 < 7 <n+1. From the set of all such counterexamples, let po be 
one with minimum degree no. That is, the claim in part (1) is true for any 
polynomial whose degree is smaller than ng. We write 

Equation: 


no 
po (z) = > agz*, 
k=0 


and we suppose that po (c;) = 0 for 7 = 1 to mp + 1, where these c;'s are 
distinct complex numbers. We use next the Root Theorem (part (d) of 
[link]) to write po (z) = (z — Cny+1)q (z), where q(z) = D7p%9 bez". We 
have that q is a polynomial of degree np — 1 and the leading coefficient a,,, 
of po equals the leading coefficient 6,1 of g. Note that for 1 < 7 < no we 
have 

Equation: 


0 = po (c;) = (ic; a Cry +1)4 (cj), 


which implies that g(c;) = 0 for 1 < 7 < no, since cj — Cn,41 # O. But, 
since deg (q) < no, the nonzero polynomial q can not be a counterexample 
to part (1), implying that g(z) = 0 for at most ng — 1 distinct points. We 
have arrived at a contradiction, and part (1) is proved. 


Next, let r be a polynomial for which r(z) = 0 for an infinite number of 
distinct points. It follows from part (1) that r cannot be a nonzero 
polynomial, for in that case it would have a degree n > 0 and could be 0 
for at most n distinct points. Hence, r is the zero polynomial, and part (2) is 
proved. 


Now, to see part (3), set r = p — q. Then r is a polynomial for which 
r(z) = 0 for infinitely many z's. By part (2), it follows then that r(z) = 0 
for all z, whence p(z) = q(z) for all z. Moreover, p — q is the zero 


polynomial, all of whose coefficients are 0, and this implies that the 
coefficients for p and q are identical. 


To prove the first inequality in part (4), suppose that |z| > 1, and from the 
backwards triangle inequality, note that 


Equation: 


Ip(z)| 


IV 


IV 


IV 


IV 


2" |S 
@ = gn-k 
n—-1 Ch 
jz" (s 2) T Cn 
n—1 
n Ck 
Iz| eal 2s ee 
n—-1 Icp| 
2-5) 
x0 |2| 
n—1 
Cc 
zl” (len |— tl 
‘20 (2! 
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I" (lena ck |) 
Z| %=0 


Set B equal to the constant (2/|c,, |) ee |c;|. Then, replacing the 1/|z| 
in the preceding calculation by 1/B, we obtain 


Equation: 


Ip (z)| 2 mlz|" 


for every z for which |z| > B. This proves the first half of part (4). 
To get the other half of part (4), suppose again that |z| > 1. We have 


Equation: 


cel |2|"; 


pal<ss 
k=0 


allel’ <2 
k=0 


so that we get the other half of part (4) by setting M = $7), |cg|. 


Finally, to see part (5), suppose that there does exist a polynomial p of 
degree n such that \/z = p(a) for all z > 0. Then x = (p(z))’ forall 

z > 0. Now p’ is a polynomial of degree 2n. By part (2), the two 
polynomials q(x) = x and (p (x))? must be the same, implying that they 
have the same degree. However, the degree of q is 1, which is odd, and the 
degree of p” is 2n, which is even. Hence, we have arrived at a 
contradiction. 

Exercise: 


Problem: 


a. Let r(z) = p(z)/q(z) and r’ (z) = p’ (z)/q’ (z) be two rational 
functions. Suppose r (z) = r’ (z) for infinitely many z's. Prove 
that r (z) = r’ (z) for all z in the intersection of their domains. Is 
it true that p = p’ andq = q’? 

b. Let p and q be polynomials of degree n and m respectively, and 
define a rational function r by r = p/q. Prove that there exist 
positive constants C' and B such that |r (z)| < C|z|"”” for all 
complex numbers z for which |z| > B. 

c. Define f : [0,00) > R by f (x) = +/z. Show that there is no 
rational function r such that f(2) = r(a) for all x > 0. That is, 
the square root function does not agree with a rational function. 

d. Define the real-valued function r on R by r(z) = 1/ (14 2’). 
Prove that there is no polynomial p such that p(x) = r(x) for 
infinitely many real numbers 2. 


e. If f is the real-valued function of a real variable given by 
f(a) = |x|, show that f is not a rational function. HINT: 
Suppose |x| = p(x)/q(x). Then |x|q(x) = p(a) implying that 
|z|q(x) is a polynomial s(a). Now use Theorem 3.1 to conclude 
that p(x) = xq(zx) for all x and that p(x) = —aq(zx) for all x. 

f. Let f be any complex-valued function of a complex variable, and 
let C1, ..., Cn be n distinct complex numbers that belong to the 
domain of f. Show that there does exist a polynomial p of degree 
n such that p(c;) = f (c;) for all 1 < 7 < n. HINT: Describe p 
in factored form. 

g. Give examples to show that the maximum and minimum of two 
polynomials need not be a polynomial or even a rational function. 


Very important is the definition of the compositiong o f of two functions f 
and g. 


Let f : S — T and g: T — U be functions. We define a function 
go f, with domain S and codomain U, by (go f)(x) = g(f(z)). 


Iff:S—>T,g:T > S,andgo f(x) =z forall x € S, then gis 
called a left inverse of f. If f o g(y) = y for all y € T, then g is called 
a right inverse for f. If g is both a left inverse and a right inverse, then 
g is called an inverse for f,f is called invertible, and we denote g by 


7 —1 
Exercise: 


Problem: 


a. Suppose f : S — TJ has a left inverse. Prove that f is 1-1. 

b. Suppose f : S — T has a right inverse. Prove that f is onto. 

c. Show that the composition of two polynomials is a polynomial 
and that the composition of two rational functions is a rational 
function. HINT: If p is a polynomial, show by induction that p” is 
a polynomial. Now use [link]. 


d. Find formulas for g o f and f © g for the following. What are the 
domains of these compositions? 


i f (v) =1+4 2? and g(x) =1/(1+ 2)”. 
ii. f(z) = x/(@ + 1) and g(x) = «/(1— 2). 
iii. f(a) = ax + band g(x) = cx +d. 


Continuity 

Next, we come to the definition of continuity. Unlike the preceding 
discussion, which can be viewed as being related primarily to the algebraic 
properties of functions, this one is an analytic notion. 


Next, we come to the definition of continuity. Unlike the preceding 
discussion, which can be viewed as being related primarily to the algebraic 
properties of functions, this one is an analytic notion. 


Let S and T be sets of complex numbers, and let f : S — T. Then f 
is said to be continuous at a pointc of S if for every positive e€, there 
exists a positive 6 such that if x € S satisfies | — c| < 6, then 
|\f(x) — f(c)| < e. The function f is called continuous on S if it is 
continuous at every point c of S. 


If the domain S of f consists of real numbers, then the function f is 
called right continuous at c if for every € > 0 there exists ad > 0 such 
that | f(a) — f(c)| < € whenever x € S and0 < x —c < 6, andis 
called left continuous at c if for every € > O there exists ad > 0 such 
that | f(a) — f(c)| < € whenever x € Sand0 > x—c> —o. 


REMARK If f is continuous at a point c, then the positive number 6 of the 
preceding definition is not unique (any smaller number would work as 
well), but it does depend both on the number € and on the point c. 
Sometimes we will write 6(€, c) to make this dependence explicit. Later, we 
will introduce a notion of uniform continuity in which 6 only depends on 
the number € and not on the particular point c. 


The next theorem indicates the interaction between the algebraic properties 
of functions and continuity. 


Let S and T be subsets of C’, let f and g be functions from S into 7’, and 
suppose that f and g are both continuous at a point c of S. Then 


1. There exists ad > 0 anda positive number M such that if |y — c| < 6 
and y € S then |f(y)| < M. That is, if f is continuous at c, then it is 
bounded near c. 

2. f +g is continuous at c. 


3. fg is continuous at c. 

4. | f| is continuous at c. 

5. If g(c) 4 0, then f/g is continuous at c. 

6. If f is a complex-valued function, and u and v are the real and 
imaginary parts of f, then f is continuous at c if and only if w and v 
are continuous at c. 


We prove parts (1) and (5), and leave the remaining parts to the exercise 
that follows. 


To see part (1), let € = 1. Then, since f is continuous at c, there exists a 

5 > Osuch that if |y—c| < dandy € S then | f(y) — f(c)| < 1. Since 

|z — w| => ||z| — |w]|| for any two complex numbers z and w (backwards 
Triangle Inequality), it then follows that || f(y)| — |f(c)|| < 1, from which 
it follows that if jy — c| < 6 then |f(y)| < |f(c)| + 1. Hence, setting 

M = |f(c)| +1, we have that if |y — c| < dandy € S, then |f(y)| < M@ 
as desired. 


To prove part (5), we first make use of part 1. Let 61, MM and 62, Mz be 
chosen so that if jy — c| < 6; and y € S then 
Equation: 


If (yl < Mi 


and if |y — c| < 6d, and y € S then 
Equation: 


lg (y)| < Mo 


Next, let e’ be the positive number |g(c)| /2. Then, there exists a 6’ > 0 such 
that if |y—c| < 6’ and y € S then |g(y) — g(c)| < e = |g(c)|/2. It then 
follows from the backwards triangle inequality that 

Equation: 


lg(y)| > €’ = |g (c)|/2s80 that |1/9(y)| < 2/\g(c)| 


Now, to finish the proof of part (5), let € > 0 be given. If 
ly — c| <min (64, 69, 6") and y € S, then from Inequalities (3.1), (3.2), and 
(3.3) we obtain 


Equation: 
fy) _ fO| _ Ifa = fog) 
gy) — gc) la(y)g(e)| 
_ Ifa) = flogle) + floate) — Flog(y)| 
la(y)||9(c)| 
fly) — FIll9lo)| + |FlO|I9(e) — 9(y)| 
=: la(y)||9(c)| 
< (if (uy) — f ()|Ma+ Milg(c) — 9(y)\)x ae 


Finally, using the continuity of both f and g applied to the positive numbers 
€, = c/(4Ms|g(c)/?) and €) = €/(4Mj,|g (c)|”), choose 5 > 0, with 

6 <min (6j, 62, 6’), and such that if |y — c| < d and y € S then 

If(y) — FO) < Gage andig(e) - 9M) < Gao: The if 
ly—c| < dandy € S we have that 


Equation: 
Be f(c) 
cB Le etc 
gy) —_g(c) 

as desired. 

Exercise: 

Problem: 


a. Prove part (2) of the preceding theorem. (It's an €/2 argument.) 

b. Prove part (3) of the preceding theorem. (It's similar to the proof 
of part (5) only easier.) 

c. Prove part (4) of the preceding theorem. 


d. Prove part (6) of the preceding theorem. 

e. Suppose S is a subset of R. Verify the above theorem replacing “ 
continuity” with left continuity and right continuity. 

f. If S is a subset of R, show that f is continuous at a point c € S if 
and only if it is both right continuous and left continuous at c. 


The composition of continuous functions is continuous. 


Let S,7, and U be subsets of C’, and let f: S ~ T andg: T > U be 
functions. Suppose f is continuous at a point c € S and that g is continuous 
at the point f(c) € T. Then the composition g o f is continuous at c. 


Let € > 0 be given. Because g is continuous at the point f(c), there exists 
an a > O such that |g(t) — g(f(c))| < € if |t — f(c)| < a. Now, using this 
positive number a, and using the fact that f is continuous at the point c, 
there exists a 6 > Oso that | f(s) — f(c)| < aif |s — c| < 6. Therefore, if 
|s —c| < 6, then | f(s) — f(c)| < a, and hence 

la(F(s)) — 9(Fle))| = Igo f(s) — 9° F(c)| < € which completes the 
proof. 

Exercise: 


Problem: 


a. If f : C + C is the function defined by f(z) = z, prove that f is 
continuous at each point of C’. 

b. Use part (a) and Theorem 3.2 to conclude that every rational 
function is continuous on its domain. 

c. Prove that a step function h : [a, b] — C is continuous 
everywhere on [a, b] except possibly at the points of the partition 
P that determines h. 


Exercise: 
Problem: 


a. Let S' be the set of nonnegative real numbers, and define 
f:S— S by f (x) = Vz. Prove that f is continuous at each 


point of S'. HINT: For c = 0, use 6 = e*. For c # 0, use the 
identity 
Equation: 


Vytve _ _y-e _yne 

VytVve VytVve” ve 

b. If f : C > R is the function defined by f(z) = |z|, show that f 
is continuous at every point of its domain. 


Vy — ve= (Vy — ve) 


Exercise: 


Problem: 


Using the previous theorems and exercises, explain why the following 
functions f are continuous on their domains. Describe the domains as 
well. 


a. f(z) = (1-2)/(14+ 2). 
b. f (z) = [1+ 24+2? + 2 — (1/z)]. 


c f(z) = 1+ 1 -la. 


Exercise: 


Problem: 


a. If cand d are real numbers, show that 
max (c,d) = (c+ d)/2 + |c— d|/2. 

b. If f and g are functions from S into R, show that 
max (f,g) = (f + 9)/2+|f — g|/2. 

c. If f and g are real-valued functions that are both continuous at a 
point c, show that max (f, g) and min (f, g) are both continuous 
at c. 


Exercise: 


Problem: 


Let N be the set of natural numbers, let P be the set of positive real 
numbers, and define f : N > P by f (n) = V1-+ 7. Prove that f is 
continuous at each point of VV. Show in fact that every function 

f : N > Cis continuous on this domain N. 


HINT: Show that for any € > 0, the choice of 6 = 1 will work. 


Exercise: 
Negations 


Problem: 


a. Negate the statement: “For every € > 0,|x| < e.” 

b. Negate the statement: “For every € > 0, there exists an x for 
which |z| < €.” 

c. Negate the statement that “ f is continuous at c.” 


The next result establishes an equivalence between the basic e, 6 definition 
of continuity and a sequential formulation. In many cases, maybe most, this 
sequential version of continuity is easier to work with than the e, 6 version. 


Let f : S — C’ bea complex-valued function on S, and let c be a point in 
S. Then f is continuous at c if and only if the following condition holds: 
For every sequence {,,} of elements of S that converges to c, the sequence 
{ f (an)} converges to f(c). Or, said a different way, if {x,,} converges to 
c, then { f (x,,)} converges to f(c). And, said yet a third (somewhat less 
precise) way, the function f converts convergent sequences to convergent 
sequences. 


Suppose first that f is continuous at c, and let {x,,} be a sequence of 
elements of S' that converges to c. Let € > 0 be given. We must find a 
natural number N such that ifn > N then |f (x,) — f (c)|< e. First, 
choose 6 > 0 so that |f(y) — f(c)| < € whenever y € S and |y — c| < 6. 
Now, choose WV so that |”, — c| < 6 whenever n > N. Then ifn > N, we 


have that |z, — c| < 6, whence |f (x,) — f (c)|< €. This shows that the 
sequence { f (x,)} converges to f(c), as desired. 


We prove the converse by proving the contrapositive statement; i.e., we will 
show that if f is not continuous at c, then there does exist a sequence {x,,} 
that converges to c but for which the sequence { f (x,)} does not converge 
to f(c). Thus, suppose f is not continuous at c. Then there exists an €g > 0 
such that for every 6 > 0 there is a y € S such that |y — c| < 6 but 

lf (y) — f (c)| = €o. To obtain a sequence, we apply this statement to 6's of 
the form 6 = 1/n. Hence, for every natural number n there exists a point 
Ly © S such that |x, —c| < 1/n but |f (z,,) — f (c)|> eo. Clearly, the 
sequence {x,,} converges to c since |%, — c| < 1/n. On the other hand, the 
sequence { f (x,)} cannot be converging to f(c), because |f (x») — f (c)| 
is always > €. 


This completes the proof of the theorem. 


Continuity and Topology 
This module contains a theorem about the connection between continuity 
and topology, involving open, closed sets and sequential continuity. 


Let f : S — T bea function, and let A be a subset of the codomain T. 
Recall that f~' (A) denotes the subset of the domain S consisting of all 
those x € S for which f(z) € A. 


Our original definition of continuity was in terms of e's and 6's. [link] 
established an equivalent form of continuity, often called “sequential 
continuity,” that involves convergence of sequences. The next result shows 
a connection between continuity and topology, i.e., open and closed sets. 


1. Suppose S is a closed subset of C' and that f : S — Cis a complex- 
valued function on S. Then f is continuous on S$ if and only if 
f* (A) is a closed set whenever A is a closed subset of C’. That is, f 
is continuous on a closed set S if and only if the inverse image of 
every closed set is closed. 

2. Suppose U is an open subset of C’ and that f : U — Cis a complex- 
valued function on U. Then f is continuous on U if and only if 
f* (A) is an open set whenever A is an open subset of C. That is, f 
is continuous on an open set U if and only if the inverse image of 
every open set is open. 


Suppose first that f is continuous on a closed set S and that A is a closed 
subset of C’. We wish to show that f~‘ (A) is closed. Thus, let {2,} be a 
sequence of points in f' (A) that converges to a point c. Because S is a 
closed set, we know that c € S, but in order to see that f+ (A) is closed, 
we need to show that c € f '(A). That is, we need to show that f(c) € A. 
Now, f(z) € A for every n, and, because f is continuous at c, we have by 
[link] that f (c) =lim f (z,). Hence, f(c) is a limit point of A, and so 

f(c) € A because A is a closed set. Therefore, c € f '(A), and f-+(A) 
is closed. 


Conversely, still supposing that S'is a closed set, suppose f is not 
continuous on S, and let c be a point of S at which f fails to be continuous. 
Then, there exists an € > 0 and a sequence {x,,} of elements of S such that 


c =lim z, but such that |f (c) — f (an)|> € for all n. (Why? See the proof 
of Theorem 3.4.) Let A be the complement of the open disk B, (f (c)). 
Then A is a closed subset of C. We have that f(xz,) € A for all n, but f(c) 
is not in A. So, a, € f~'(A) for all n, but c =lim zp is not in f~* (A). 
Hence, f~1(A) does not contain all of its limit points, and so f~1 (A) is 
not closed. Hence, if f is not continuous on S, then there exists a closed set 
A such that f~!(A) is not closed. This completes the proof of the second 
half of part (1). 


Next, suppose U is an open set, and assume that f is continuous on U. Let 
A be an open set in C, and let c be an element of f~' (A). In order to prove 
that f~+ (A) is open, we need to show that c belongs to the interior of 

f-1 (A). Now, f(c) € A,A is open, and so there exists an € > 0 such that 
the entire disk B, (f (c)) C A. Then, because f is continuous at the point c, 
there exists a 6 > O such that if |a — c| < 6 then | f(x) — f(c)| < e.In 
other words, if x € B;(c), then f (x) € B.(f (c)) C A. This means that 
B;(c) is contained in f~' (A), and hence c belongs to the interior of 

f+ (A). Hence, if f is continuous on an open set U, then f~! (A) is open 
whenever A is open. This proves half of part (2). 


Finally, still assuming that U is open, suppose f~' (A) is open whenever A 
is open, let c be a point of S, and let us prove that f is continuous at c. 
Thus, let € > 0 be given, and let A be the open set A = B-.(f (c)). Then, 
by our assumption, f + (A) is an open set. Also, c belongs to this open set 
f+ (A), and hence c belongs to the interior of f~! (A). Therefore, there 
exists a d > O such that the entire disk bs (c) C f~1(A). But this means 
that if € S' satisfies | — c| < 6, then x € B;(c) C f+ (A), and so 

f(x) € A= B.(f (c)). Therefore, if |a — c| < 6, then | f(x) — f(c)| < «, 
which proves that f is continuous at c, and the theorem is completely 
proved. 


Deeper Analytic Properties of Continuous Functions 

We collect here some theorems that show some of the consequences of 
continuity. Some of the theorems apply to functions either of a real variable 
or of a complex variable, while others apply only to functions of a real 
variable. We begin with what may be the most famous such result, and this 
one is about functions of a real variable. 


We collect here some theorems that show some of the consequences of 
continuity. Some of the theorems apply to functions either of a real variable 
or of a complex variable, while others apply only to functions of a real 
variable. We begin with what may be the most famous such result, and this 
one is about functions of a real variable. 

Intermediate Value Theorem 


If f : [a,b] — Risa real-valued function that is continuous at each point of 
the closed interval [a, b], and if v is a number (value) between the numbers 
f(a) and f(b), then there exists a point c between a and b such that 


j(d=% 


If v = f(a) or f(b), we are done. Suppose then, without loss of generality, 
that f(a) < u < f(b). Let S be the set of all x € [a, b] such that f(x) < v, 
and note that S' is nonempty and bounded above. (a € S, and 6 is an upper 
bound for S.) Let c =sup S. Then there exists a sequence {z,,} of 
elements of S that converges to c. (See [link].) So, f (c) =lim f (z,) by 
[link]. Hence, f(c) < v. (Why?) 


Now, arguing by contradiction, if f(c) < v, let € be the positive number 

vu — f(c). Because f is continuous at c, there must exist a d > 0 such that 
|f(y) — f(c)| < € whenever |y — c| < dandy € |a, b]. Since any smaller 6 
satisfies the same condition, we may also assume that 6 < b — c. Consider 
y=c+ 6/2. Then y € [a,b], |y—c| < 6, andso |f(y) — f(c)| <. 
Hence f(y) < f(c) + € = v, which implies that y € S. But, since 

c =sup S,c must satisfy c > y = c+ 6/2. This is a contradiction, so 

f(c) = v, and the theorem is proved. 


The Intermediate Value Theorem tells us something qualitative about the 
range of a continuous function on an interval [a, 6]. It tells us that the range 


is “connected;” i.e., if the range contains two points c and d, then the range 
contains all the points between c and d. It is difficult to think what the 
analogous assertion would be for functions of a complex variable, since 
“between” doesn't mean anything for complex numbers. We will eventually 
prove something called the Open Mapping Theorem in [link] that could be 
regarded as the complex analog of the Intermediate Value Theorem. 


The next theorem is about functions of either a real or a complex variable. 


Let f : S — C’ bea continuous function, and let C' be a compact (closed 
and bounded) subset of S. Then the image f(C) of C is also compact. That 
is, the continuous image of a compact set is compact. 


First, suppose f(C) is not bounded. Thus, let {z,,} be a sequence of 
elements of C’ such that, for each n,| f(x,)| > n. By the Bolzano- 
Weierstrass Theorem, the sequence {z,,} has a convergent subsequence 
{£n,}. Let e =lim z,,. Then x € C' because Cis a closed subset of C’. Co, 
f (z) =lim f (z,,) by [link]. But since | f (x,,,)|> my, the sequence 

{ f (an,) } is not bounded, so cannot be convergent. Hence, we have arrived 
at a contradiction, and the set f(C’) must be bounded. 


Now, we must show that the image f(C) is closed. Thus, let y be a limit 
point of the image f(C) of C, and let y =lim y, where each y,, € f (C). 
For each n, let 2, € C satisfy f (tn) = Yn. Again, using the Bolzano- 
Weierstrass Theorem, let {x,,} be a convergent subsequence of the 
bounded sequence {z,,}, and write z =lim z,,. Then x € C, since C is 
closed, and from [link] 

Equation: 


y =lim f (z,) =lim f (¢n,) = f (x), 


showing that y € f(C), implying that f(C) is closed. 


This theorem tells us something about the range of a continuous function of 
a real or complex variable. It says that if a subset of the domain is closed 
and bounded, so is the image of that subset. 


The next theorem is about continuous real-valued functions of a complex 
variable, and it is one of the theorems to remember. 


Let f be a continuous real-valued function on a compact subset S of C. 
Then f attains both a maximum and a minimum value on S. That is, there 
exist points 21 and z2 in S such that f (z1) < f(z) < f (zo) forall z € S. 


We prove that f attains a maximum value, leaving the fact that f attains a 
minimum value to the exercise that follows. Let Mo be the supremum of the 
set of all numbers f(x) for x € S. (How do we know that this supremum 
exists?) We will show that there exists an z2 € S such that f (z2) = Mo. 
This will finish the proof, since we would then have f (z2) = Mo > f (z) 
for all z € S. Thus, let {y,} be a sequence of elements in the range of f for 
which the sequence {y,,} converges to Mp. (This is Exercise 2.20 again.) 
For each n, let x, be an element of such that y, = f (x,). Then the 
sequence { f (x,)} converges to Mo. Let {x,,} be a convergent 
subsequence of {z,,}. (How?) Let z2 =lim z,,,. Then z, € S, because S is 
closed, and f (z2) =lim f (xp,), because f is continuous. Hence, 

f (22) = Mo, as desired. 

Exercise: 


Problem: 


a. Prove that the f of the preceding theorem attains a minimum 
value on S. 

b. Give an alternate proof of [link] by using [link], and then proving 
that a closed and bounded subset of R contains both its supremum 
and its infimum. 

c. Let S be a compact subset of C’,, and let c be a point of C that is 
not in S. Prove that there is a closest point to c in S. That is, 
show that there exists a point w € S such that |w — c| < |z —c| 
for all points z € S. HINT: The function z > |z — c| is 
continuous on the set S. 


Exercise: 


Problem: 


Let f : [a,b] > R be a real-valued function that is continuous at each 
point of [a, 6}. 


a. Prove that the range of f is a closed interval [a’, b'|. Show by 
example that the four numbers f(a), f(b),a’ and b’ can be 
distinct. 

b. Suppose f is 1-1. Show that, if c is in the open interval (a, b), 
then f(c) is in the open interval (a’, b’). 


We introduce next a different kind of continuity called uniform continuity. 
The difference between regular continuity and uniform continuity is a bit 
subtle, and well worth some thought. 


A function f : S — C’is called uniformly continuous on S if for each 
positive number e, there exists a positive number 6 such that 
\f(z) — f(y)| < € for all x, y € S satisfying |x — y| < 6. 


Basically, the difference between regular continuity and uniform conintuity 
is that the same 6 works for all points in S. 


Here is another theorem worth remembering. 


A continuous complex-valued function on a compact subset S' of C’ is 
uniformly continuous. 


We argue by contradiction. Thus, suppose f is continuous on S' but not 
uniformly continuous. Then, there exists an e > 0 for which no positive 
number 6 satisfies the uniform continuity definition. Therefore, thinking of 
the 6's as ranging through the numbers 1/n, we know that for each positive 
integer n, there exist two points x,, and y, in S so that 


1. |Yn — tn|< 1/n, and 


2. |f (Yn) — fF (#n)2 € 


Otherwise, some 1/n would suffice for a 6. Let {xp} be a convergent 
subsequence of {~,,} with limit x. By (1) and the triangle inequality, we 
deduce that z is also the limit of the corresponding subsequence {yp,} of 
{yn}. But then f (x) =lim f (zn,) =lim f (Yyn,), implying that 

0 =lim |f (Yn,) — f (&n,)|, which implies that | f (y,,) — f (@n,)|< € for 
all large enough k. But that contradicts (2), and this completes the proof. 


Continuous functions whose domains are not compact sets may or may not 
be uniformly continuous, as the next exercise shows. 
Exercise: 


Problem: 


a. Let f : (0,1) — Rbe defined by f(x) = 1/zx. Prove that f is 
continuous at each z in its domain but that f is not uniformly 
continuous there. HINT: Set € = 1, and consider the pairs of 
points x, = 1/n and y, = 1/ (n+ 1). 

b. Let f : [1, co) — [1, co) be defined by f (x) = ./a. Prove that f 
is not bounded, but is nevertheless uniformly continuous on its 
domain. HINT: Take 6 = e. 


Let f : S — T be a continuous 1-1 function from a compact (closed and 
bounded) subset of C’' onto the (compact) set T. Let g : T — S denote the 
inverse function f—! of f. Then g is continuous. The inverse of a 
continuous function, that has a compact domain, is also continuous. 


We prove that g is continuous by using [link]; i.e., we will show that 

g ‘(A) is closed whenever A is a closed subset of C’. But this is easy, since 
g '(A)=g 1} (ANS) = f (ANS), and this is a closed set by [link], 
because AS is compact. See part (e) of [link]. 


REMARK Using the preceding theorem, and the exercise below, we will 
show that taking nth roots is a continuous function. that is, the function f 
defined by f (x) = x/” is continuous. 

Exercise: 


Problem: 


Use the preceding theorem to show the continuity of the following 
functions. 


a. Show that if n is an odd positive integer, then there exists a 
continuous function g defined on all of R such that g(x) is an nth 
root of x for all real numbers z. That is, (g (x))” = for all real 
x. (The function f (x) = x” is 1-1 and continuous.) 

b. Show that if n is any positive integer then there exists a unique 
continuous function g defined on [0, 00) such that g(x) is an nth 
root of x for all nonnegative z. 

c. Let r = p/q be a rational number. Prove that there exists a 
continuous function g : [0, 00) — [0, 00) such that g(x)? = 2? 
for all x > 0; i.e., g(x) = x” for all x > 0. 


Let f be a continuous 1-1 function from the interval [a, b] onto the interval 
[c, d]. Then f must be strictly monotonic, i.e., strictly increasing 
everywhere or strictly decreasing everywhere. 


Since f is 1-1, we clearly have that f(a) 4 f(b), and, without loss of 
generality, let us assume that c = f(a) < f(b) = d. It will suffice to show 
that if @ and 8 belong to the open interval (a, b), anda < £, then 

f(a) < f(6). (Why will this suffice?) Suppose by way of contradiction 
that there exists a < @ in (a,b) for which f(a) > f(8). We use the 
intermediate value theorem to derive a contradiction. Consider the four 
pointsa <<a < 6 <b. Either f(a) < f(a) or f(8) < f(b). (Why?) In the 
first case (f(a) < f(a@)), f({a, a]) contains every value between f(a) and 
f(a). And, f(a, 6]) contains every value between f(a) and f(). So, let v 
be anumber such that f(a) < v,f(8) < v, and v < f(a) (why does such a 
number v exist?). By the Intermediate Value Theorem, there exists 

£1 € (a, qa) such that v = f(21), and there exists an x2 € (a, 8) such that 
v = f(x). But this contradicts the hypothesis that f is 1-1, since 71 # 2. 
A similar argument leads to a contradiction in the second case f(8) < f(b). 


(See the following exercise.) Hence, there can exist no such q@ and £, 
implying that f is strictly increasing on [a, 6]. 
Exercise: 


Problem: 


Derive a contradiction from the assumption that f(3) < f(b) in the 
preceding proof. 


Power Series Functions 

We introduce next a new kind of function. It is a natural generalization of a 
polynomial function. Among these will be the exponential function and the 
trigonometric functions. We begin by discussing functions of a complex 
varible, although totally analogous definitions and theorems hold for 
functions of a real variable. 


The class of functions that we know are continuous includes, among others, 
the polynomials, the rational functions, and the nth root functions. We can 
combine these functions in various ways, e.g., sums, products, quotients, 
and so on. We also can combine continuous functions using composition, so 
that we know that nth roots of rational functions are also continuous. The 
set of all functions obtained in this manner is called the class of “algebraic 
functions.” Now that we also have developed a notion of limit, or infinite 
sum, we can construct other continuous functions. 


We introduce next a new kind of function. It is a natural generalization of a 
polynomial function. Among these will be the exponential function and the 
trigonometric functions. We begin by discussing functions of a complex 
varible, although totally analogous definitions and theorems hold for 
functions of a real variable. 


Let {an}, be a sequence of real or complex numbers. By the power 
series function f (z) = )>°?_) @nz” we mean the function f : S > C 
where the domain S is the set of all z € C’ for which the infinite series 
> anz" converges, and where f is the rule that assigns to such a 

z € S the sum of the series. 


The numbers {a,,} defining a power series function are called the 
coefficients of the function. 


We associate to a power series function f (z) = )>7°.) @nz” its 
sequence {Sy} of partial sums. We write 
Equation: 


N 
Sy (z) = S° G2 
n=0 


Notice that polynomial functions are very special cases of power series 
functions. They are the power series functions for which the 
coefficients {a,, } are all 0 beyond some point. Note also that each 
partial sum Sy for any power series function is itself a polynomial 
function of degree less than or equal to N. Moreover, if f is a power 
series function, then for each z in its domain we have 

f (z) =limy Sy (z). Evidently, every power series function is a 
“limit” of a sequence of polynomials. 


Obviously, the domain S = S'r of a power series function f depends 
on the coefficients {a,,} determining the function. Our first goal is to 
describe this domain. 


Let f be a power series function: f (z) = 07°.) @nz” with domain S. 
Then: 


iL. 
Zs 


Bs 


0 belongs to S. 

If anumber t belongs to S, then every number u, for which |u| < |¢], 
also belongs to S. 

S is a disk of radius r around 0 in C (possibly open, possibly closed, 
possibly neither, possibly infinite). That is, S consists of the disk 

B, (0) = {z: |z| < r} possibly together with some of the points z for 
which |z| = r. 


. The radius r of the disk in part (3) is given by the Cauchy-Hadamard 


formula: 
Equation: 


1 
DL —— 


a 
lim sup|a,|_/” 


which we interpret to imply that r = 0 if and only if the limsup on the 
right is infinite, and r = oo if and only if that limsup is 0. 


Part (1) is clear. 


To see part 2, assume that t belongs to S and that |w| < |t]. We wish to 
show that the infinites series S~ a,u” converges. In fact, we will show that 
S*|anu"| is convergent, i.e., that )* a,,u" is absolutely convergent. We are 
given that the infinite series S* a,t” converges, which implies that the 
terms a,,t” tend to 0. Hence, let B be a number such that |a,z”|< B for all 
n, and set a = |u|/|t|. Then a < 1, and therefore the infinite series 

> Ba” is convergent. Finally, |a,u” |=|ant"|a" < Ba”, which, by the 
Comparison Test, implies that }*|a,w”| is convergent, as desired. 


Part (3) follows, with just a little thought, from part 2. 
To prove part (4), note that lim sup|a,,| 1/” cither is finite or it is infinite. 
assume first that the sequence {|a,,|!/"} is not bounded; i.e., that 


Vim — a, Then, given any number p, there are infinitely many 


lim sup|a,,| 
terms |a,,| 1/ that are larger than p. So, for any z 4 0, there exist infinitely 
many terms |a,,|!/” that are larger than 1/|z|. But then |a,,2”|> 1 for all 
such terms. Therefore the infinite series ye Qnz” is not convergent, since 
lim a,,z” is not zero. So no such z is in the domain S. This shows that if 


lim supla,|'/" = oo, then r = 0 = 1/ lim sup|a,|'"". 


Now, suppose the sequence {|a,,|!/”} is bounded, and let L denote its 
limsup. We must show that 1/r = LD. We will show the following two 
claims: (a) if 1/|z| > L, then z € S, and (b) if 1/|z| < L, thenz ¢ S. 
(Why will these two claims complete the proof?) Thus, suppose that 

1/|z| > L. Let 8 be a number satisfying L < 8 < 1/|z|, and let a = A|z|. 
Then 0 < a < 1. Now there exists a natural number JV so that lan| eSB 
for alln > N, or equivalently |a,|< 6” for alln > N. (See part (a) of 
Exercise 2.17. ) This means that for all mn > N we have 

lan2z” |=|a,/B"||Gz|" < a”. This implies by the Comparison Test that the 
power series 5° a,,2” is absolutely convergent, whence convergent. Hence, 
z € S, and this proves claim (a) above. Incidentally, note also that if 

L = 0, this argument shows that r = oo, as desired. 


To verify claim (b), suppose that 1/|z| < LD. Then there are infinitely many 
terms of the sequence {|a,,|!/"} that are greater than 1/|z|. (Why?) For 


each such term, we would then have |a,,z”|> 1. This means that the infinite 
series )) a,z” is not convergent and z ¢ S, which shows claim b. 


Hence, in all cases, we have that r = 1/ lim sup|a,| 1/n as desired. 


If f is a power series function, the number r of the preceding theorem 
is called the radius of convergence of the power series. The disk S of 
radius r around 0, denoted by B, (0), is called the disk of convergence. 


Exercise: 


Problem: 


Compute directly the radii of convergence for the following power 
series functions, i.e., without using the Cauchy-Hadamard formula. 
Then, when possible, verify that the Cauchy-Hadamard formula agrees 
with your computation. 


Exercise: 


Problem: 


a. Use part (e) of [link] to show that a power series function p is an 
even function if and only if its only nonzero coefficients are even 
ones, i.e., the @»;'s. Show also that a power series function is an 
odd function if and only if its only nonzero coefficients are odd 
ones, i.e., the aoz41'S. 

b. Suppose f (z) = S>7°.9 @2427" is a power series function that is 
an even function. Show that 
Equation: 


fiz) = S- (-1)¥aga™ = f (2), 
k= 


0 


where f® is the power series function obtained from f by 
alternating the signs of its coefficients. We call this function f* 
the alternating version of f. 

c. If g(z) = og Gany127*"! is a power series function that is an 
odd function, show that 
Equation: 


where again g® is the power series function obtained from g by 
alternating the signs of its coefficients. 

d. If f is any power series function, show that 
Equation: 


f (iz) = fe (tz) + fo (tz) = fe (2) + ifs (2), 
and hence that f. (iz) = f% (z) and f, (iz) = ip (z). 


The next theorem will not come as a shock, but its proof is not so simple. 


Let f (z) = >> anz” be a power series function with radius of convergence 
r. Then f is continuous at each point in the open disk B, (0), i-e., at each 
point z for which |z| < r. 


Let z € B, (0) be given. We must make some auxiliary constructions 
before we can show that f is continuous at z. First, choose a z’ such that 
|z| <|z'|< r. Next, set b, = |na,,|, and define g(z) = 5~ b, 2”. By the 
Cauchy-Hadamard formula, we see that the power series function g has the 
same radius of convergence as the power series function f. Indeed, 


1/n 


lim sup|by, | uM" tim sup n lan |=lim n'/" lim sup |a,|. Therefore, 2! 


belongs to the domain of g. Let M be a number such that each partial sum 


of the series g (z’) 


eee b,z'” is bounded by M. 


Now, let « > 0 be given, and choose 6 to be the minimum of the two 
positive numbers e€|z’|/M and |z'|—|z|. We consider any y for which 
ly — z| < 6. Then y € B, (0),|y| <|z’|, and 


Equation: 


f(y) — F(z) 


IA 


IA 


IA 


IA 


This completes the proof. 


Exercise: 


Problem: 


lim|Si (y) — Sw (z)| 
N 
lim| S An (y” — z”) 
n=0 
N 
lim ps an ly _— Zz 
N n—-1 
ea > on lv — 21 3? Ie 2 
n=1 j=0 
N n—-1 ; 
ip 3 foal 21S ef 
n=1 j=0 


ly —2| lim -—- iP 


M 
6 lim —— 
N |z'| 


a. 


b. 


Let f (z) = S072.) anz” be a power series function, and let 

P (2) = reg bez" be a polynomial function. Prove that f + p 
and fp are both power series functions. Express the coefficients 
for f + pand fp in terms of the a,'s and b;'s. 

Suppose f and g are power series functions. Prove that f + gisa 
power series function. What is its radius of convergence? What 
about cf? What about fg? What about f/g? What about | f|? 


Exercise: 


Problem: 


a. 


b. 


Prove that every polynomial is a power series function with 
infinite radius of convergence. 

Prove that 1/z and (1/(z — 1)(z-+ 2)) are not power series 
functions. (Their domains aren't right.) 


. Define f (z) = S>°°, (—1)"z2"!. Prove that the radius of 


n=0 
convergence of this power series function is 1, and that 


f (2) = zr for all z € By (0). Conclude that the rational 
function z/ (1 + z”) agrees with a power series function on the 
disk B, (0). But, they are not the same function. HINT: Use the 


infinite geometric series. 


[link] and [link] and [link] raise a very interesting and subtle point. Suppose 
f (z) = SS ay2z" is a power series function having finite radius of 
convergence r > 0. [link] says that f is continuous on the open disk, but it 

does not say anything about the continuity of f at points on the boundary of 


this disk that are in the domain of f, i,e., at points zo for which |zo|= r. and 


>) anzp% converges. Suppose g(z) is a continuous function whose domain 
contains the open disk B, (0) and also a point zo, and assume that 


f(z) = 


worth some thought to understand just what this question means. It amounts 


g(z) for all z € B, (0). Does f(zo) have to agree with g(zo)? It's 


to a question of the equality of two different kinds of limits. f(zo) is the 
sum of an infinite series, the limit of a sequence of partial sums, while, 


because g is continuous at z9,g(zo =lim,-,,, g (z). At the end of this 
chapter, we include a theorem of Abel that answers this question. 


The next theorem is the analog for power series functions of part (2) of 
[link] for polynomials. We call it the “Identity Theorem,” but it equally well 
could be known as the “Uniqueness of Coefficients Theorem,” for it implies 
that different coefficients mean different functions. 

Identity Theorem 


Let f (z) = >> anz” be a power series function with positive radius of 
convergence r. Suppose {zz} is a sequence of nonzero distinct numbers in 
the domain of f such that: 


1, lim 2, = 0. 
2. f(z) = 0 for all k. 


Then f is identically 0 (f(z) = 0 for all z € S). Moreover, each coefficient 
Gn of f equals 0. 


Arguing by induction on n, let us prove that all the coefficients a, are 0. 
First, since f is continuous at 0, and since lim z, = 0, we have that ao, 
which equals f (0), =lim f (z,) = 0. 


Assume then that a9 = aj =... = an_1 = 0. Then 
Equation: 


(ae rn 


f(z) 


Anz” + An412 


CO : 
_— 4n ol 
= 2 yb 
j=0 


where b; = @n+,- If g is the power series function defined by 
g(z) = 95 6b;2, then, by the Cauchy-Hadamard Formula, we have that the 
radius of convergence for g is the same as that for f. (Why does 

ie 
lim sup|dj| ‘s =lim sup |a,|!/*?) We have that f (z) = 2"g(z) forall z 
in the common disk of convergence of these functions f and g. Since, for 


eachk, zz # Oand f (zx) = 229 (zx) = 0, it follows that g(z,) = 0 for 
every k. Since g is continuous at 0, it then follows as in the argument above 
that g(0) = 0. But, g (0) = bp = ay. Hence a, = 0, and so by induction 
all the coefficients of the power series function f are 0. Clearly this implies 
that f(z) is identically 0. 


Suppose f and g are two power series functions, that {z;} is a sequence of 
nonzero points that converges to 0, and that f (z,) = g (zx) for all &. Then 
f and g have the same coefficients, the same radius of convergence, and 
hence f(z) = 9(z) for all z in their common domain. 

Exercise: 


Problem: 


a. Prove the preceding corollary. (Compare with the proof of [link].) 

b. Use the corollary, and the power series function g(z) = z, to 
prove that f(z) = |z| is not a power series function. 

c. Show that there are power series functions that are not 
polynomial functions. 

d. Let f (z) = 55 ayz” be a power series function with infinite 
radius of convergence, all of whose coefficients are positive. 
Show that there is no rational function r = p/q for which 
f(z) = r(z) for all complex numbers z. Conclude that the 
collection of power series functions provides some new functions. 
HINT: Use the fact that for any n we have that f (2) > a,x” for 
all positive x. Then, by choosing n appropriately, derive a 
contradiction to the resulting fact that |p (2) /q(x)| > anx” for 
all positive x. See part (b) of [link]. 


The Elementary Transcendental Functions 

This module dives deeper into power series functions, examining their 
properties, which coefficients in a power series give special properties, which 
have converging tendencies, and when they are defined everywhere. The 
exponential function is defined. 


Having introduced a class of new functions (power series functions), we 
might well expect that some of these will have interesting and unexpected 
properties. So, which sets of coefficients might give us an exotic new 
function? Unfortunately, at this point in our development, we haven't much 
insight into this question. It is true, see [link], that most power series 
functions that we naturally write down have finite radii of convergence. Such 
functions may well be new and fascinating, but as a first example, we would 
prefer to consider a power series function that is defined everywhere, i.e., one 
with an infinite radius of convergence. Again revisiting [link], let us consider 
the coefficients a,, = 1/n!. This may seem a bit ad hoc, but let's have a look. 


Define a power series function, denoted by exp, as follows: 
Equation: 


We will call this function, with 20-20 hindsight, the exponential 
function. 


What do we know about this function, apart from the fact that it is defined for 
all complex numbers? We certainly do not know that it has anything to do 
with the function e*; that will come in the next chapter. We do know what the 
number e is, but we do not know how to raise that number to a complex 
exponent. 


All of the exponential function's coefficients are positive, and so by part (d) 
of [link] exp is not a rational function; it really is something new. It is natural 
to consider the even and odd parts exp, and exp, of this new function. And 
then, considering the constructions in [link], to introduce the alternating 
versions exp? and exp? of them. 


Define two power series functions cosh (hyperbolic cosine) and sinh 
(hyperbolic sine) by 
Equation: 


exp (z)+ exp (—z) 


cosh (z) = ; 


and sinh (z) = 


and two other power series functions cos (cosine) and sin (sine) by 


Equation: 
exp (2z)+ exp (—2z 
COs (z) =cosh (iz) — exp (ie) exp (2). 
and 
Equation: 
exp (2z)— exp (—iz 
sin (z) = —i sinh (iz) = exp (tz)— exp (~#z) 
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The five functions just defined are called the elementary transcendental 
functions, the sinh and cosh functions are called the basic hyperbolic 
functions, and the sine and cosine functions are called the basic 
trigonometric or circular functions. The connections between the 
hyperbolic functions and hyperbolic geometry, and the connection 
between the trigonometric functions and circles and triangles, will only 
emerge in the next chapter. From the very definitions, however, we can 
see a connection between the hyperbolic functions and the trigonometric 
functions. It's something like interchanging the roles of the real and 
imaginary axes. This is probably worth some more thought. 


Exercise: 


Problem: 


a. Verify the following equations: 
Equation: 


n 


(oe) 
Z 
exp (z) = ee, 
n=0 : 


SS Aa a ee 


= cosh Ocak (2): 


Equation: 
a ae y2kt+l 
si = Sa eg ee : 
i ee 1) Geta! 
7 ee y2k+1 
a (2k +1)!’ 
Equation: 
ye. gay 16 2k 
Os = l-—+—-—4+...4 1 es 
ee) 2 a 6 (1) apr t 
= (1) ’ 
= (2k)! 
Equation: 
Co 2k+1 
sinh (z) = ; 
©) =>. Gera) 
and 
Equation: 
Ws) 3 2k 
cosh (z) = 
4 (2k)! 


(These expressions for the elementary transcendental functions are 
perhaps the more familiar ones from a calculus course.) 


b. Compute the radii of convergence for the elementary 
transcendental functions. HINT: Do not use the Cauchy-Hadamard 
formula. Just figure out for which z's the functions are defined. 

c. Verify that exp (0) = 1,sin (0) =sinh (0) = 0, and 
cos (0) =cosh (0) = 1. 

d. Prove that all five of the elementary transcendental functions are 
not rational functions. 

e. Can you explain why sin? (z)+ cos? (z) = 1? What about the “ 
Addition Formula” 

Equation: 


sin (z + w) =sin (z) cos (w)+ cos (z) sin (w). 


Exercise: 


Problem: 


a. Show that the elementary transcendental functions map real 
numbers to real numbers. That is, as functions of a real variable, 
they are real-valued functions. 

b. Show that the exponential function exp is not bounded above. 
Show in fact that, for each nonnegative integer n,exp (x) /x” is 
unbounded. Can you show that exp (a) = e”? What, in fact, does 
e” mean if z is an irrational or complex number? 


At this point, we probably need a little fanfare! 
THEOREM 3.14159 (Definition of 7) 


There exists a smallest positive number « for which sin (2) = 0. We will 
denote this distinguished number zx by the symbol z. 


First we observe that sin (1) is positive. Indeed, the infinite series for sin (1) 
is alternating. It follows from the alternating series test (Theorem 2.18) that 
sin (1) >1—1/6=5/6. 


Next, again using the alternating series test, we observe that sin (4) < 0. 
Indeed, 
Equation: 

48 45 4? 49 


Hence, by the intermediate value theorem, there must exist a number c 
between 1 and 4 such that sin (c) = 0. So, there is at least one positive 
number x such that sin (2) = 0. However, we must show that there is a 
smallest positive number satisfying this equation. 


Let A be the set of all x > 0 for which sin (a2) = 0. Then A is a nonempty 
set of real numbers that is bounded below. Define 7 =inf A. We need to 
prove that sin (7) = 0, and that 7 > 0. Clearly then it will be the smallest 
positive number « for which sin (x) = 0. 


By [link], there exists a sequence {x;} of elements of A such that 

m7 =lim xx. Since sin is continuous at 7, it follows that 

sin (7) =limsin (x;) =lim 0 = 0. Finally, if 7 were equal to 0, then by the 
Identity Theorem, Theorem 3.14, we would have that sin z = 0 for all x. 
Since this is clearly not the case, we must have that 7 > 0. 


Hence, 7 is the smallest (minimum) positive number z for which 
sin (x) = 0. 


As hinted at earlier, the connection between this number z and circles is not 
at all evident at the moment. For instance, you probably will not be able to 
answer the questions in the next exercise. 

Exercise: 


Problem: 
a. Can you see why sin (x + 27) =sin (x)? That is, is it obvious that 


sin is a periodic function? 
b. Can you prove that cos (7) = —1? 


REMARK Defining 7 to be the smallest positive zero of the sine function 
may strike many people as very much “out of the blue.” However, the zeroes 
of a function are often important numbers. For instance, a zero of the 
function x? — 2 is a square root of 2, and that number we know was 
exztremely important to the Greeks as they began the study of what real 
numbers are. A zero of the function z” + 1 is something whose square is -1, 
i.e., negative. The idea of a square being negative was implausible at first, 
but is fundamental now, so that the zero of this particular function is critical 
for our understanding to numbers. Very likely, then, the zeroes of any “new” 
function will be worth studying. For instance, we will soon see that, perhaps 
disappointingly, there are no zeroes for the exponential function: exp (z) is 
never 0. Maybe it's even more interesting then that there are zeroes of the 
sine function. 


The next theorem establishes some familiar facts about the trigonometric 
functions. 


1. exp (iz) =cos (z) +7 sin (z) forall z € C. 


2. Let {z,} be a sequence of complex numbers that converges to 0. Then 
Equation: 


lim 


3. Let {z,} be a sequence of complex numbers that converges to 0. Then 
Equation: 


1— 1 
i cos (zz) = 
Zz Zz 


Exercise: 


Problem: Prove [link]. 


HINT: For parts (2) and (3), use [link]. 


Analytic Functions and Taylor Series 
A summary of taylor series functions and their properties, including some 
practice exercises relating the taylor series to the identity theorem. 


Let S be a subset of C’, let f : S — C bea complex-valued function, 
and let c be a point of S. Then f is said to be expandable in a Taylor 
series around c with radius of convergence r if there exists anr > 0 
such that B, (c) C S, and f(z) is given by the formula 

Equation: 


for all z € B, (c). 


Let S be a subset of R, let f : S — R bea real-valued function on S, 
and let c be a point of S. Then f is said to be expandable in a Taylor 
series around c with radius of convergence r if there exists anr > 0 
such that the interval (c — r,c +r) C S, and f(z) is given by the 


formula 
Equation: 
CO 
f (2) = S— an(a — c)” 
n=0 


for alla € (c—r,c+r). 


Suppose S is an open subset of C’. A function f : S — Cis called 
analytic on S if it is expandable in a Taylor series around every point c 


of S. 


Suppose S$ is an open subset of R. A function f : S — C is called real 
analytic on S if it is expandable in a Taylor series around every point c 


of S. 


Suppose S is a subset of C’, that f : S — C’is a complex-valued function 
and that c belongs to S. Assume that f is expandable in a Taylor series 


around c with radius of convergence r. Then f is continuous at each 
z€ B,(c). 


Suppose S$ is a subset of R, that f : S > Risa real-valued function and 
that c belongs to S. Assume that f is expandable in a Taylor series around c 
with radius of convergence r. Then f is continuous at each 

xe (c—r,c+r). 


If we let g be the power series function given by g(z) = 4> anz”", and T be 
the function defined by T(z) = z —c, then f(z) = g(T(z)), and this 
theorem is a consequence of [link] and [link]. 

Exercise: 


Problem: Prove that f(z) = 1/z is analytic on its domain. 


HINT: Use r = |c|, and then use the infinite geometric series. 
Exercise: 

Problem: 

State and prove an Identity Theorem, analogous to [link], for functions 

that are expandable in a Taylor series around a point c. 
Exercise: 

Problem: 

a. Prove that every polynomial is expandable in a Taylor series 
around every point c. HINT: Use the binomial theorem. 


b. Is the exponential function expandable in a Taylor series around 
the number —1? 


Uniform Convergence 

We introduce now two different notions of the limit of a sequence of functions. Some definitions cover 
uniform convergence and pointwise convergence. The Weierstrass M-Test is stated and proven, as well the 
theorem of Abel. 


We introduce now two different notions of the limit of a sequence of functions. Let S be a set of complex 
numbers, and let { f,,} be a sequence of complex-valued functions each having domain S. 


We say that the sequence { f,, }converges or converges pointwise to a function f : S > C if for every 
x € Sand every € > 0 there exists a natural number NV, depending on z and ¢, such that for every 
n> N,|\fn (x) — f (x)| < e. That is, equivalently, { f,,} converges pointwise to f if for every x € S 
the sequence { f,, (x)} of numbers converges to the number f(z). 


We say that the sequence { f,, }converges uniformly to a function f if for every € > 0, there exists an 
N, depending only on e, such that for every n > N andevery x € S,|f, (x) — f (x)| <. 


If {u,} is a sequence of functions defined on S, we say that the infinite series )> unconverges 


uniformly if the sequence {s N= Sy un} of partial sums converges uniformly. 


These two definitions of convergence of a sequence of functions differ in subtle ways. Study the word order 
in the definitions. 
Exercise: 


Problem: 


a. Prove that if a sequence {f,,} of functions converges uniformly on a set S to a function f then it 
converges pointwise to f. 

b. Let S = (0,1), and for each n define f, (x) = x”. Prove that { f,,} converges pointwise to the 

zero function, but that { f,, } does not converge uniformly to the zero function. Conclude that 

pointwise convergence does not imply uniform convergence. HINT: Suppose the sequence does 

converge uniformly. Take « = 1/2, let N be a corresponding integer, and consider z's of the form 

x =1-—hA for tiny h's. 

Suppose the sequence { f,, } converges uniformly to f on S, and the sequence {g,,} converges 

uniformly to g on S. Prove that the sequence {f,, + g,} converges uniformly to f + gon S. 

d. Suppose { f,, } converges uniformly to f on S, and let c be a constant. Show that {cf,, } converges 
uniformly to cf on S. 

e. Let S = R, and set fn (v) = « + (1/n). Does {f,} converge uniformly on S? Does { f?} 
converge uniformly on S? What does this say about the limit of a product of uniformly 
convergent sequences versus the product of the limits? 


. Suppose a and b are nonnegative real numbers and that |a — b| < e”. Prove that |va — Vo|< 2€. 


HINT: Break this into cases, the first one being when both 4/a and Vb are less than e. 

g. Suppose { f,, } is a sequence of nonnegative real-valued functions that converges uniformly to f 
on S. Use part (f) to prove that the sequence {Vv fu converges uniformly to a Te 

1+1/n 


ie) 


loa) 


h. For each positive integer n, define f,, on (—1,1) by fp (x) = |z| . Prove that the sequence 
{f,} converges uniformly on (—1, 1) to the function f(x) = |x|. HINT: Let € > 0 be given. 
Consider |z|'s that are < € and |z|'s that are > e. For |x| < e, show that |f, (x) — f (x)| < € for 
all n. For |x| > €, choose N so that jet/” —1| < e. How? 


Exercise: 


Problem: 


Let {fn} be a sequence of functions on a set S, let f be a function on S, and suppose that for each n 
we have |f (x)—fn (x)| < 1/n for all x € S. Prove that the sequence { f;, } converges uniformly to f. 


We give next four important theorems concerning uniform convergence. The first of these theorems is 
frequently used to prove that a given function is continuous. The theorem asserts that if f is the uniform 
limit of a sequence of continuous functions, then f is itself continuous. 

The uniform limit of continuous functions is continuous. 


Suppose { f,, } is a sequence of continuous functions on a set S C C, and assume that the sequence { f,, } 
converges uniformly to a function f. Then f is continuous on S. 


This proof is an example of what is called by mathematicians a “3e argument.” 


Fix ana € Sand ane > 0. We wish to finda dé > 0 such that if y € S and |y — x| < 6 then 
If(y) — fla)| <e. 


We use first the hypothesis that the sequence converges uniformly. Thus, given this « > 0, there exists a 
natural number N such that ifm > N then | f (z)—f, (z)| < €/3 for all z € S. Now, because fy is 
continuous at x, there exists a d > 0 such that if y € S and |y — z| < 6 then | fw (y) — fx (@)| < €/3. So, 
if y € S and |y — 2| < 6, then 


Equation: 
f(y) — flx)| = If (y)—fn (y) + fy (y) — fr (2) + fr (2) — Ff (2)| 
< |f(y)—fy (y)i+1 fy (y) — fv (@)|+1 Fy (2) — F (2) 
ee 


This completes the proof. 


REMARK Many properties of functions are preserved under the taking of uniform limits, e.g., continuity, 
as we have just seen. However, not all properties are preserved under this limit process. Differentiability is 
not, integrability is sometimes, being a power series function is, and so on. We must be alert to be aware of 
when it works and when it does not. 

Weierstrass M-Test 


Let {un} be a sequence of complex-valued functions defined on a set S C C. Write Sy for the partial sum 
Sule) = ya Un (a). Suppose that, for each n, there exists an M,, > 0 for which |u,, (x)| < My, for all 
xz € S. Then 


1. If > M,, converges, then the sequence {.S;y} converges uniformly to a function S. That is, the infinite 
series 5> uy, converges uniformly. 
2. If each function u, is continuous, and }> M,, converges, then the function S of part (1) is continuous. 


Because > M,, is convergent, it follows from the Comparison Test that for each x € S the infinite series 
rg Un (x) is absolutely convergent, hence convergent. Define a function S by 


52) =>) ,-¢ Vile) Shin Sy (2): 


To show that {Sv} converges uniformly to S, let € > 0 be given, and choose a natural number N such that 
peter M,, < €. This can be done because 5) M,, converges. Now, for any « € S and any m > N, we 
have 


Equation: 
|S (#)—Sm (a)| = [im Si (2) — Sm (2)| 
= |jim (St (2) ~ Sn (2)) 
= fim, |S: (2) ~ Sw (2)| 
k 
k 
< dm Zt 
k 
< lim M,, 
k-00 Reser 
= S> M,, 
n=m+1 
< SoM, 
n=N 
< € 
This proves part (1). 


Part (2) now follows from part (1) and [link], since the S'jy's are continuous. 


Let f (z) = 072.9 @nz” be a power series function with radius of convergence r > 0, and let {Si (z)} 
denote the sequence of partial sums of this series: 
Equation: 


N 
Siz) = yan 
n=0 


If 0 < r’ <r, then the sequence {Sy} converges uniformly to f on the diskB,, (0). 


Define a power series function g by g(z) = 072-9 |an|z”, and note that the radius of convergence for g is 


the same as that for f, i.e., r. Choose ¢ so that r’ < t < r. Then, since t belongs to the disk of convergence 
of the power series function g, we know that }>”° , |an|t” converges. Set my = |a,|t”, and note that 


>> m,, converges. Now, for each z € B, (0), we have that 
Equation: 


lanz” Slane = lage? =the 


so that the infinite series }> a,,z" converges uniformly on B,. (0) by the Weierstrass M-Test. 


Exercise: 


Problem: 


Let f (z) = O72.) 2”. Recall that the radius of convergence for f is 1. Verify that the sequence {Sv} 
of partial sums of this power series function fails to converge uniformly on the full open disk of 
convergence B, (0), so that the requirement that r’ < r is necessary in the preceding theorem. 


The next theorem shows that continuous, real-valued functions on closed bounded intervals are uniform 
limits of step functions. Step functions have not been mentioned lately, since they aren't continuous 
functions, but this next theorem will be crucial for us when we study integration in [link]. 


Let f be a continuous real-valued function on the closed and bounded interval [a, 6]. Then there exists a 
sequence {h,,} of step functions on |a, b] that converges uniformly to f. 


We use the fact that a continuous function on a compact set is uniformly continuous ([link]). 


For each positive integer n, let 6, be a positive number satisfying | f(x) — f(y)| < 1/nif |x — y| < by. 
Such a 6, exists by the uniform continuity of f on [a,b]. Let P, = {xo < 41 < ... < &@m,} bea partition 
of {a, b| for which x; — 24-1 < dp for all 1 < i < mp. Define a step function h,, on |[a, 6] as follows: 


If x;_1 <a < aj, thenh, (x) = f (x,_1). This defines h,, (x) for every x € [a, b), and we complete the 
definition of h,, by setting h,, (b) = f (6). It follows immediately that h,, is a step function. 


Now, we claim that | f (x)—hn (x)| < 1/n for all x € [a, b]. This is clearly the case for x = b, since 
f (b) = h, (6) for all n. For any other a, let 7 be the unique index such that #;_1 < # < x;. Then 
Equation: 


| f (t)—hn (x)| = If (x) — f (@i-1)|< 1/n 


because |x—2j_1|< dn. 


So, we have defined a sequence {h,,} of step functions, and the sequence {h,,} converges uniformly to f 
by [link]. 


We close this chapter with a famous theorem of Abel concerning the behavior of a power series function on 
the boundary of its disk of convergence. See the comments following [link]. 
Abel 


Suppose f (z) = }3°° 9 anz” is a power series function having finite radius of convergence r > 0, and 
suppose there exists a point zo on the boundary of B, (0) that is in the domain of f; i.e., }) a,zg converges 
to f(zo). Suppose g is a continuous function whose domain contains the open disk B,. (0) as well as the 
point zo, and assume that f(z) = g(z) for all z in the open disk B, (0). Then f(zo) must equal g(zo). 


For simplicity, assume that r = 1 and that zp = 1. See the exercise that follows this proof. Write S;, for the 
partial sum of the a,'s: S$, = . 3 Gn. In the following computation, we will use the Abel Summation 
Formula in the form 

Equation: 


See [link]. Let € be a positive number. Then, for any 0 < ¢ < 1 and any positive integer N, we have 
Equation: 


N N 
l9(1)- FO) = IgG) —- fF) + f()- dant” + > ant” - 


< |9Q)—-g(t)| + | FC J Sant" 1S at = F() 
N 
< |g) —g(t)|+1FQ- do ant 
N-1 - 
+|Svt" + 5° S, (&" — t"*1) — f (1) 
n=0 
= |91)-g9@1+1F J Sant 
N-1 N-1 
+|Snt® + $7 (Sa — Sw) (t® — #7") + Sw SS (te - t"7) - FD) 
n=0 n=0 
= |I9Q)-g9(@)|+1fC Jat 
N-1 
+| 55 (Sn — Sw) (t? — #"**) + Sw (eS ve a) } =f (l) 
n=0 
< |9Q)-g(t)| + |F Ce Jat 
N-1 
+| > (Sn — Sw) (#" — #"*") [41S — f (1) 
n=0 
< |9Q)-g9(t)| + (FC Joan 
P N-1 
+| 55 (Sn — Sw) (€” — #7) [+] D2 (Sn — Sw) (e" — #77) [+1Sy — F (1) 
n=0 n=P+1 
< |9Q)-g(t)| + |F )- ant 
N-1 
+130 (Sn — Sw) (e7—e")/+ S015, — Syl (t? — t"**) + [Sw — Ff (1)| 
n=0 n=P+1 


ti + to +t3 +t4 +15. 


First, choose an integer Mj so that if P and N are both larger than My, then t4 < e. (The sequence {.S;} is 
a Cauchy sequence, and ie — t**! is telescoping.) 


Fix such a P > Mj. Then choose ad > Oso that if 1 > ¢ > 1 — 6, then both ¢; and t3 < e. How? 


Fix such a ¢. Finally, choose a N, greater than Mj, and also large enough so that both fg and ts are less 
than e. (How?) 


Now, |g(1) — f(1)| < 5e. Since this is true for every € > 0, it follows that f(1) = g(1), and the theorem is 
proved. 
Exercise: 


Problem: 


Let f,g,7, and 29 be as in the statement of the preceding theorem. Define f(z) = f (zz) and 
9(z) = 9 (202). 


a. Prove that f is a power series function f (z) = S77?) 6nz”, with radius of convergence equal to 


1, and such that }>°° 9 6, converges to f (1); ie., 1 is in the domain of f. 

b. Show that g is a continuous function whose domain contains the open disk B, (0) and the point 
=A; 

c. Show that, if f (1) = g(1), then f (zo) = g (zo). Deduce that the simplification in the preceding 
proof is justified. 

d. State and prove the generalization of Abel's Theorem to a function f that is expandable in a 
Taylor series around a point c. 


Differentiation, Local Behavior Ein = -1. 
This is an overview of important topics on differentiation, including links to 
theorems from the same author. 


In this chapter we will finally see why e’” is Along the way, we will 
give careful proofs of all the standard theorems of Differential Calculus, 
and in the process we will discover all the familiar facts about the 
trigonometric and exponential functions. At this point, we only know their 
definitions as power series functions. The fact that or that 
e” ¥  e*e¥ are not at all obvious. In fact, we haven't even yet defined what 
is meant by e” for an arbitrary number x 


The main theorems of this chapter include: 


1. The Chain Rule ((link]), 

2. The Mean Value Theorem ([link]), 

3. The Inverse Function Theorem ([link]), 

4. The Laws of Exponents (({link] and [link]), and 
5. Taylor's Remainder Theorem ((link]). 


The Limit of a Function 

The concept of the derivative of a function is what most people think of as 
the beginning of calculus. However, before we can even define the 
derivative we must introduce a kind of generalization of the notion of 
continuity. That is, we must begin with the definition of the limit of a 
function. 


The concept of the derivative of a function is what most people think of as 
the beginning of calculus. However, before we can even define the 
derivative we must introduce a kind of generalization of the notion of 
continuity. That is, we must begin with the definition of the limit of a 
function. 


Let f : S — C’bea function, where S C C, and let c be a limit point 
of S that is not necessarily an element of S. We say that fhas a limit L 
as z approaches c, and we write 
Equation: 

lim f (z) = L, 


ZC 


if for every € > 0 there exists a 6 > 0 such that if z € S and 
0 < |z—c| <6, then |f(z) — L| <e. 


If the domain S is unbounded, we say that f has a limit L as z 
approachesox, and we write 
Equation: 

L=lim f(z), 


Z—7 00 


if for every € > 0 there exists a positive number B such that if z € S$ 
and |z| > B, then |f(z) — L| < . 


Analogously, if S C R, we say lim,_,. f (x) = Lif for every e > 0 
there exists a real number B such that if 2 € S and x > B, then 
|\f(x) — L| < e. And we say that lim,_,_. f (x) = L if for every 


€ > 0 there exists a real number B such that if 2 € S and x < B, then 
\f(z) — L| <e. 


Finally, for f : (a,b) — Ca function of a real variable, and for 

c € |a, b], we define the one-sided (left and right) limits of f at c. We 
say that f has a left hand limit of L at c, and we write 

L =lim,-,--o f (a), if for every € > 0 there exists ad > 0 such that if 
x € (a,b) and0 <c— <2 < 6 then |f(x) — L| < e. We say that f has 
a right hand limit of L at c, and write L =lim,_,.;0 f (x), if for every 
€ > O there exists ad > 0 such that if € S and0 < x —c < éthen 
|f(z) — L| <e. 


The first few results about limits of functions are not surprising. The 
analogy between functions having limits and functions being continuous is 
very close, so that for every elementary result about continuous functions 
there will be a companion result about limits of functions. 


Let c be a complex number. Let f : S — Cand g: S — C'be functions. 
Assume that both fand g have limits as x approaches c. Then: 


1. There exists ad > 0 and a positive number M such that if z € S and 
0 < |z—c| < 6 then |f(z)| < M. That is, if f has a limit as z 
approaches c, then f is bounded near c. 

2. Equation: 


lim (f (z) + 9(z)) =lim f (z)+ lim g(2). 


Z—>C 
3. Equation: 


lim (f (z)g(z)) =lim f(z) lim g (z). 


2Z—>C Z—>C ZC 


4. If lim, ,. g(z) 4 0, then 
Equation: 


a f(z) 7 limp -v< f (2) 
ze g(z) lim;-s¢ 9 (Z) 


) 


5. If u and v are the real and imaginary parts of a complex-valued 
function f, then u and v have limits as z approaches c if and only if f 
has a limit as z approaches c. And, 


Equation: 
lim f(z) =lim u (z) +i lim v (2). 
Exercise: 
Problem: 


a. Prove [link]. HINT: Compare with [link]. 

b. Prove that lim,_,. f (x) = L if and only if, for every sequence 
{x,} of elements of S that converges to c, we have 
lim f(x,) = DL. HINT: Compare with [link]. 

c. Prove the analog of [link] replacing the limit as z approaches c by 
the limit as z approaches oo. 


Exercise: 


Problem: 


a. Prove that a function f : S — C is continuous at a point c of S if 
and only if lim,_,. f (x) = f (c). HINT: Carefully write down 
both definitions, and observe that they are verbetim the same. 

b. Let f be a function with domain S, and let c be a limit point of S 
that is not in S. Suppose g is a function with domain S$ U {c}, 
that f(a) = g(x) for all x € S, and that g is continuous at c. 
Prove that lim,-,. f (x) = g(c). 


Exercise: 


Problem: 


Prove that the following functions f have the specified limits L at the 
given points c. 


a. f (a) = (2 — 8)/ (2? — 4), c= 2,andL =3 
bf (eo) = (1/2? 41), €=1, and D1 
c. f(z) = (a8 —1)/ (#® +1), ¢ =4, and L = —4/3. 
d. f (x) = (sin («)+ cos (z)— exp (x))/ (a?) ,¢ = 0, and 
£L=-1. 
Exercise: 
Problem: 


Define f on the set S of all nonzero real numbers by f(x) = c if 
x < Oand f(x) = dif zx > 0. Show that lim, ,o f (x) exists if and 
only if c = d. 


(b) Let f : (a,b) — C be a complex-valued function on the open 
interval (a, b). Suppose c is a point of (a, b). Prove that lim,_,. f (x) 
exists if and only if the two one-sided limits lim,_,.9 f (x) and 
lim,_5c+0 f (x) exist and are equal. 


Exercise: 
Change of variable in a limit 


Problem: 


Suppose f : S — Cis a function, and that lim,_,. f (x) = D. Define 
a function g by g(y) = f(y+c). 


a. What is the domain of g? 

b. Show that 0 is a limit point of the domain of g and that 
limy_+0 g (y) =lim, ¢ f (2): 

c. Suppose T C C, that h : T — S, and that lim,.g h (y) =. 
Prove that 
Equation: 


lim f (h(y)) =lim f (x) = L. 


yd ed 


REMARK When we use the word “ interior” in connection with a set S, it 
is obviously important to understand the context; i.e., is S being thought of 
as a set of real numbers or as a set of complex numbers. A point c is in the 
interior of a set S of complex numbers if the entire disk B, (c) of radius € 
around c is contained in S. While, a point c belongs to the interior of a set S 
of real numbers if the entire interval (c — €,c + €) is contained in S. 

Hence, in the following definition, we will be careful to distinguish between 
the cases that f is a function of a real variable or is a function of a complex 
variable. 


The Derivative of a Function 

Now begins what is ordinarily thought of as the first main subject of calculus, the 
derivative. A definition of differentiable and some important theorems concerning 
derivatives, such as the chain rule, are included. 


Now begins what is ordinarily thought of as the first main subject of calculus, the 
derivative. 


Let S be a subset of R, let f : S — C’ be a complex-valued function (of a real 
variable), and let c be an element of the interior of S. We say that f is differentiable 
at c if 

Equation: 


exists. (Here, the number A is a real number.) 


Analogously, let S' be a subset of C, let f : S — C be a complex-valued function (of 
a complex variable), and let c be an element of the interior of S. We say that f is 
differentiable at c if 

Equation: 


exists. (Here, the number h is a complex number.) 


If f : S + Cis a function either of a real variable or a complex variable, and if S’ 
denotes the subset of S consisting of the points c where f is differentiable, we define 
a function f’: S’ > C by 

Equation: 


The function f’ is called the derivative of f. 


A continuous function f : [a,b] — C that is differentiable at each point x € (a, d), 
and whose the derivative f’ is continuous on (a, b), is called a smooth function on 

[a, b]. If there exists a partition {a = rp < 41 < ... < x, = 5b} of [a, B] such that f 
is smooth on each subinterval [x;_1, x;|, then f is called piecewise smooth on |a, 5]. 


Higher order derivatives are defined inductively. That is, f’” is the derivative of f’, 
and so on. We use the symbol f(”) for the nth derivative of f. 


REMARK In the definition of the derivative of a function f, we are interested in the 
limit, as h approaches 0, not of f but of the quotient g(h) = a! . Notice that 0 is 
not in the domain of the function q, but 0 is a limit point of that domain. This is the reason 
why we had to make such a big deal above out of the limit of a function. The function gq is 
often called the differential quotient. 


REMARK As mentioned in [link], we are often interested in solving for unknowns that 
are functions. The most common such problem is to solve a differential equation. In such 
a problem, there is an unknown function for which there is some kind of relationship 
between it and its derivatives. Differential equations can be extremely complicated, and 
many are unsolvable. However, we will have to consider certain relatively simple ones in 


this chapter, e.g., f’ = f,f’ = —f, and f” = +f. 


There are various equivalent ways to formulate the definition of differentiable, and each 
of these ways has its advantages. The next theorem presents one of those alternative ways. 


Let c belong to the interior of a set S (either in R or in C), and let f: S > Cbea 
function. Then the following are equivalent. 


1. f is differentiable at c. That is, 


Equation: 
lim f(e+h) ~ flo) exists. 
h-0 h 
2. Equation: 
lim TEVA HO) exists. 
zZ—-C r—C 


3. There exists a number L and a function 6 such that the following two conditions 
hold: 
Equation: 


f(c +h) — f(c) = Lh + 0(h) 


and 
Equation: 


In this case, L is unique and equals f’ (c), and the function 6 is unique and equals 


fle+h)— fle) — fi (e)h. 
That (1) and (2) are equivalent follows from [link] by writing x asc + h. 


Suppose next that f is differentiable at c, and define 


Equation: 

L= f'(c) =jm Het" 9) 
Set 
Equation: 

0(h) = f(ct+h)—fle)— fi (oh. 
Then clearly 
Equation: 


f(c+h) — f(c) = Lh+ Oh), 


which is [link]. Also 


Equation: 
aa _ fer He fie 
= AE - 100, 


which tends to 0 as h approaches 0 because f is differentiable at c. Hence, we have 
established [link] and [link], showing that (1) implies (3). 


Finally, suppose there is a number ZL and a function @ satisfying [link] and [link]. Then 
Equation: 


f(e+h) — fle) 


6(h) 
h h 


32, 


which converges to L as h approaches 0 by [link] and part (2) of [link]. Hence, 
L = f' (c), andso 0(h) = f (c +h) — f (c) — f' (c)h. Therefore, (3) implies (1), and 


the theorem is proved. 


REMARK Though it seems artificial and awkward, Condition (3) of this theorem is very 
convenient for many proofs. One should remember it. 
Exercise: 


Problem: 


a. What is the domain of the function 6 of condition (3) in the preceding theorem? 
Is 0 in this domain? Are there any points in the interior of this domain? 

b. Let Z and @ be as in part (3) of the preceding theorem. Prove that, given an 
€ > 0 there exists a 6 > 0 such that if |h| < 6 then |O(h)| < eh]. 


If f : S + Cis a function, either of a real variable or a complex variable, and if f is 
differentiable at a point c of S, then f is continuous at c. That is, differentiability implies 
continuity. 


We are assuming that lim;,_,9 (f (c + h) — f (c))/h = L. Hence, there exists a positive 
number do such that Se A Aa fo) — L| < 1if|h| < do, implying that 

lf(e +h) — f(c)| < AIL + 1)whenever |h| < do. So, if € > 0 is given, let 6 be the 
minimum of do and €/(|Z| + 1). If y € S and |y — c| < 4, then, thinking of y as being 
c+h, 

Equation: 


If(y) — Fl) = |Fle + A) — Fle) < [ANIL] + 2) = ly — lJ] + 1) <e 


(Every y can be written as c + h for some h, and |y — c| = |A].) 
Exercise: 


Problem: Define f(z) = |z| for z € C. 


a. Prove that f is continuous at every point of C’. 

b. Show that, if f is differentiable at a point c, then f’ (c) = 0. HINT: Using part 
(b) of [link], evaluate f’ (c) in the following two ways. 
Equation: 


is eae L 
and 
Equation: 


lc+7;|—le| 

/ s n 

i 
Show that the only way these two limits can be equal is for them to be 0. 

c. Conclude that f is not differentiable anywhere. Indeed, if it were, what would 
the function @ have to be, and why wouldn't it satisfy [link]? 

d. Suppose f : R > Ris the function of a real variable that is defined by 
f(x) = |x|. Show that f is differentiable at every point « # 0. How does this 
result not contradict part (c)? 


The following theorem generalizes the preceding exercise. 


Suppose f : S — Risa real-valued function of a complex variable, and assume that f is 
differentiable at a point c € S. Then f’ (c) = 0. That is, every real-valued, differentiable 
function f of a complex variable satisfies f’ (c) = 0 for all c in the domain of f’. 


We compute f’ (c) in two ways. 


Equation: 
c++)- 
f' (c) =lim ( n) J is a real number.. 
Equation: 
ee i Gas ee ee 
f' (c) =lim ——————— is a purely imaginary number. 


Hence, f’ (c) must be 0, as claimed. 


REMARK This theorem may come as a surprise, for it shows that there are very few 
real-valued differentiable functions of a complex variable. For this reason, whenever 

f :S — Risa real-valued, differentiable function, we will presume that f is a function 
of a real variable; i.e., that the domain S C R. 


Evaluating limp_,9 q (A) in the two different ways, A real, and h pure imaginary, led to 
the proof of the last theorem. It also leads us to make definitions of what are called 
“partial derivatives” of real-valued functions whose domains are subsets of C = R?. As 
the next exercise will show, the theory of partial derivatives of real-valued functions is a 
much richer theory than that of standard derivatives of real-valued functions of a single 
complex variable. 


Let f : S + Rbe defined ona set S C C = R?, andletc = (a,b) = ++bibea 
point in the interior of S. We define the partial derivative of f with respect to x at the 
point c = (a, b) by the formula 

Equation: 


and the partial derivative of f with respect to y at c = (a, b) by the formula 
Equation: 


(a ) a ee f(a,b+ h) ZZ f(a, b) 
tialy*’ ’  h-0 h , 


whenever these limits exist. (In both these limits, the variable h is a real variable. )( 


It is clear that the partial derivatives of a function arise when we fix either the real 
part of the variable or the imaginary part of the variable to be a constant, and then 
consider the resulting function of the other (real) variable. We will see in [link] that 
there is a definite difference between a function's being differentiable at a point 

c = (a + bi) in the complex plane C versus its having partial derivatives at the point 


(a, b) in R?. 


Exercise: 


Problem: 


a. Suppose f is a complex-valued function of a complex variable, and assume that 
both the real and imaginary parts of f are differentiable at a point c. Show that 
f is differentiable at c and that f’ (c) = 0. 

b. Let f = u-+ iv be a complex-valued function of a complex variable that is 
differentiable at a point c. Prove that both partial derivatives of u and v exist at 
c = (a,b), and in fact that 


Equation: 
ee ie 
c) +4 =] (e 
tialx tialx 
and 
Equation: 
tialu _ tialv 


c. Define a complex-valued function f on C = R? by 
f(z) = f(x + ty) = x — ty. Write f = u + iv, and show that both partial 
derivatives of u and v exist at every point, but that f is not a differentiable 
function of the complex variable z. 


The next theorem is, in part, what we call in calculus the “differentiation formulas.” 


Let f and g be functions (either of a real variable or a complex variable), which are both 
differentiable at a point c. Let a and b be complex numbers. Then: 


1. af + bg is differentiable at c, and (af + bg)’ (c) = af! (c) + bg! (c). 
2. (Product Formula) fg is differentiable at c, and 
(f9)' (c) = f' (eg (e) + f (0)! (©). 
3. (Quotient Formula) f/g is differentiable at c (providing that g(c) 0), and 
Equation: 
P\ ic Or O-fioe © 
=| C=. 
(9 (c)) 


9 
4. If f = w+ iv is a complex-valued function, then f is differentiable at a point c if 
and only if u and v are differentiable at c, and f’ (c) = u’ (c) + iv’ (c). 


We prove part (2) and leave parts (1), (3), and (4) for the exercises. We have 
Equation: 


(fg9)(c + h) — (f9)(c) 


7 a ae ; 
= tn fle+h)gle+h) — fle)gle +h) 
h-0 h 
Lt JOE = FOgo) 
h-0 h 
= lim few?) f(c) Tee 
; _. g(e+h) — g(c) 

eA 

= f'(c)g(e) + f(e)g' (e); 


where we have used [link], [link], and [link]. 
Exercise: 


Problem: 


a. Prove parts (1), (3), and (4) of [Link]. 

b. If f and g are real-valued functions that are differentiable at a point c, what can 
be said about the differentiability of max (f, g)? 

c. Let f be a constant function f(z) = k. Prove that f is differentiable everywhere 
and that f’ (z) = 0 for all z. 

d. Define a function f by f(z) = z. Prove that f is differentiable everywhere and 
that f’ (z) = 1 for all z. 

e. Verify the usual derivative formulas for polynomial functions: If 


p(z) = pe, a,2", then p’ Z= re kapz*?. 


What about power series functions? Are they differentiable functions? If so, are their 
derivatives again power series functions? In fact, everything works as expected. 


Let f be a power series function f (z) = S>°°_) anz” having radius of convergence 
r > 0. Then f is differentiable at each point z in its open disk B, (0) of convergence, and 
Equation: 


[o) o.) 
ie ae = ae. 
n=0 n=1 


The proof will use part (3) of [link]. Fix an z with |z| < r. Choose r’ so that |z| <r’ <r, 
and write a for r’ — |z|, ie., |z| + a =r’. Note first that the infinite series \-"° 9 |an|r’” 
converges to a positive number we will call M. Also, from the Cauchy-Hadamard 
Formula, we know that the power series function }> na,,w” has the same radius of 
convergence as does f, and hence the infinite series }* na,z"~‘ converges to a number 
we will denote by L. We define a function 0 by 0(h) = f(z+h) — f(z) — LA from 
which it follows immediately that 

Equation: 


f(z+h) — f(z) = Lh+ Oh), 


which establishes [link]. To complete the proof that f is differentiable at z, it will suffice 
to establish [link], i.e., to show that 
Equation: 


That is, given € > 0 we must show that there exists a d > 0 such that if 0 < |h| < 6 then 


Equation: 


z+h) — f(z) 
h 


1a (ya =|=! 


L| <e. 


Assuming, without loss of generality, that |h| < a, we have that 
Equation: 


flzt+h) — fz) _ duno An(Z +h)" — dino Anz” 


ee a a lee 
h h 
_ | Xinwo nm eo) 2" 8h") — Lino nz” 
= | 
_ | Zireoan (Cozeo( 2" *H4) ~ 2") 
7 h 
_ | Qunai Gn (Sokaa (,) 2" *h*) L| 
h 
2 oe) ; n n weet 7 oe) ; ae 
te ie 
o) n n ies : 66 ms ms 
= [Xo (35(7)- Enh *) -S( Dew 
= . " ce nm ) 
Sa (S5()- 


Me 


n 


) 
an) (a dla fae 
( 


IA 
= 
Me 


nr n—k, _|k-2 
Jie” lal 


lA 
= 


IA 
> 
Qe 
S 
3 
—— 
~ 3 
Nee 
<< 
3 
zs 
Q 
= 


so that if 6 = «/“, then |(h) /h| < €, whenever |h| < 6, as desired. 


a2? 


REMARK [link] shows that indeed power series functions are differentiable, and in fact 
their derivatives can be computed, just like polynomials, by differentiating term by term. 
This is certainly a result we would have hoped was true, but the proof is not trivial. 


The next theorem, the Chain Rule, is another nontrivial one. It deals with the 
differentiability of the composition of two differentiable functions. Again, the result is 
what we would have wanted, the composition of two differentiable functions is itself 
differentiable, but the argument required to prove it is tricky. 

Chain Rule 


Let f : S + C’bea function, and assume that f is differentiable at a point c. Suppose 
g: I — Cisa function, that T C C, that the number f(c) € T, and that g is 
differentiable at f(c). Then the composition g o f is differentiable at c and 

Equation: 


Using part (3) of [link], write 
Equation: 


9(f (ce) +k) — g(f (c)) = Lgk + Oy (k) 


and 
Equation: 


Fleth) —f (ec) =Lyh + 6; (A). 
We know from that theorem that L, = g’ (f (c)) and Ly = f' (c). And, we also know 


that 
Equation: 


Define a function k(h) = f(¢ + h) — f(c). Then, by [link], we have that 

limp.0 k(h) = 0. We will show that go f is differentiable at c by showing that there 
exists a number L and a function @ satisfying the two conditions of part (3) of [link]. 
Thus, we have that 

Equation: 


go f(c+h)—gof(c) =9(f(ce+h)) — 9(f(c)) 
= Af (c) + k(h)) — 9(f(c)) 
= Lk (h) + 6, (k(A)) 
= Lg(f(e+h) — f (c)) + Oy (k (h)) 
= Lg (Leh + Of (h)) + 6, (k (A) 
= L Leh + Lg6; (hk) + 0, (k(h)). 


We define L = Lgl = g' (f (c)) f’ (c), and we define the function @ by 
Equation: 
0 (h) = LBs (h) + 8g (k(h)). 
By our definitions, we have established [link] 
Equation: 


go f(ct+h)—go f(c) = Lh+ ah), 


so that it remains to verify [link]. 


We must show that, given e > 0, there exists ad > 0 such that if 0 < |h| < 6 then 
|0(h)/h| < €. First, choose an e’ > 0 so that 
Equation: 


IL efiLAe +e Se (4. 3). 


Next, using part (b) of [link], choose a 6’ > 0 such that if |k| < 6’ then |6, (k)| < é’ |kj. 
Finally, choose 6 > 0 so that if 0 < |h| < 6, then the following two inequalities hold. 

|k (h)| < 6’ and |6¢ (h)| < e' |h|. The first can be satisfied because f is continuous at c, 
and the second is a consequence of part (b) of [link]. Then: if 0 < |h| < 6, 

Equation: 


JA(A)| = |Lg95 (h) + Oy (&(R))| 


= [Lg ||O¢ (h)| +10, (& ())| 
< |Zgle’ |h| + €' |k (h)| 
|L,le' |h| +e’ |f (c +h) — f (0) 
|Zgle' |h| + €' |Lyh + 65 (h)| 
< |Zgle’ |h| + €'|Ly||h] + €' |; (A)| 
< [Zale |h| + €'|Lz ||h] + €'e' hl 


o] 


= (|Zo|e’ + JD yle’ +e’) |p 


whence 
Equation: 


| (h)/h| & (|Z|e’ + I,|e +e”) <€, 


as desired. 
Exercise: 


Problem: 


a. 


ee) 


Derive the familiar formulas for the derivatives of the elementary transcendental 
functions: 
Equation: 


! ane; ay ! : ! : 
exp =exp, sin =cos, ,sinh = cosh, cosh =sinh and cos = — sin. 


. Define a function f as follows. f (z) =cos? (z)+ sin? (z). Use part (a) and the 


Chain Rule to show that f’ (z) = 0 for all z € C. Does this imply that 
cos” (z)+ sin” (z) = 1 for all complex numbers 2? 


. Suppose f is expandable in a Taylor series around the point c : 


f(z) = og an(z — c)” for all z € B, (c). Prove that f is differentiable at 
each point of the open disk B, (c), and show that 
Equation: 


f'(2) = Sinag(2— 0)". 


HINT: Use [link] and the chain rule. 


Consequences of Differentiability, the Mean Value Theorem 

The first derivative test for extreme values is proven, followed by the mean value 
theorem, the inverse function theorem, and some exercises pertaining to these 
theorems and proofs. 


Let f : S — Rbea real-valued function of a real variable, and let c be an 
element of the interior of S. Then f is said to attain a local maximum at c if there 
exists a d > 0 such that (c — 6,c + 6) C Sand f(c) > f(z) for all 

ze (c—6,c+9). 


The function f is said to attain a local minimum at c if there exists an interval 
(c — 6,c +6) C S such that f(c) < f(z) forall z € (c— 6,c +4). 


The next theorem should be a familiar result from calculus. 
First Derivative Test for Extreme Values 


Let f : S > R bea real-valued function of a real variable, and let c € S be a point at 
which f attains a local maximum or a local minimum. If f is differentiable at c, then 
f' (c) must be 0. 


We prove the theorem when f attains a local maximum at c. The proof for the case 
when f attains a local minimum is completely analogous. 


Thus, let 6 > 0 be such that f(c) > f(a) for all x such that |” — c| < 6. Note that, if 
n is sufficiently large, then both c + 2 and c — 2 belong to the interval 

(c — 6,c + 6). We evaluate f’ (c) in two ways. First, 

Equation: 


because the numerator is always nonpositive and the denominator is always positive. 
On the other hand, 
Equation: 


since both numerator and denominator are nonpositive. Therefore, f’ (c) must be 0, as 
desired. 


Of course we do not need a result like [link] for functions of a complex variable, since 
the derivative of every real-valued function of a complex variable necessarily is 0, 
independent of whether or not the function attains an extreme value. 


REMARK As mentioned earlier, the zeroes of a function are often important numbers. 
The preceding theorem shows that the zeroes of the derivative f’ of a function f are 
intimately related to finding the extreme values of the function f. The zeroes of f’ are 
often called the critical points for f. Part (a) of the [link] establishes the familiar 
procedure from calculus for determining the maximum and minimum of a continuous 
real-valued function on a closed interval. 

Exercise: 


Problem: 


a. Let f be a continuous real-valued function on a closed interval [a, b], and 
assume that f is differentiable at each point x in the open interval (a, b). Let 
M be the maximum value of f on this interval, and m be its minimum value 
on this interval. Write S for the set of all x € (a,b) for which f' (x) = 0. 
Suppose z is a point of [a, b] for which f(x) is either M or m. Prove that x 
either is an element of the set S, or x is one of the endpoints a or b. 

b. Let f be the function defined on [0, 1/2) by f(t) = ¢/(1 — t). Show that 
f(t) < 1 for allt € [0,1/2). 

c. Let t € (—1/2, 1/2) be given. Prove that there exists an r < 1, depending 
on t, such that |t/(1 + y)| <r for all y between 0 and t. 

d. Let t be a fixed number for which 0 < ¢t < 1. Show that, for allO < s <t, 
(t—s)/(1+s) <t. 


Probably the most powerful theorem about differentiation is the next one. It is stated as 
an equation, but its power is usually as an inequality; i.e., the absolute value of the left 
hand side is less than or equal to the absolute value of the right hand side. 

Mean Value Theorem 


Let f be a real-valued continuous function on a closed bounded interval [a, b], and 
assume that f is differentiable at each point x in the open interval (a, b). Then there 
exists a point c € (a,b) such that 

Equation: 


This proof is tricky. Define a function h on [a, 6] by 
Equation: 


h(x) = #(f(b) — f(a)) — f(x) (6 — a). 


Clearly, h is continuous on [a, b] and is differentiable at each point x € (a, b). Indeed, 
Equation: 


It follows from this equation that the theorem will be proved if we can show that there 
exists a point c € (a,b) for which h’ (c) = 0. Note also that 
Equation: 


h(a) = a( f(b) — fla)) — f(a)(o— a) = af(b) — bf (a) 


and 
Equation: 


showing that h(a) = h(b). 


Let m be the minimum value attained by the continuous function h on the compact 
interval [a, b] and let M/Z be the maximum value attained by h on [a, b]. If m = M, 
then A is a constant on [a, b| and h’ (c) = 0 for all c € (a, b). Hence, the theorem is 
true if M = m, and we could use any c € (a,b). If m 4 M, then at least one of these 
two extreme values is not equal to h(a). Suppose m # h(a). Of course, m is also not 
equal to h(b). Let c € [a, b] be such that h(c) = m. Then, in fact, c € (a, b). By 
[link], h’ (c) = 0. 


We have then that in every case there exists a point c € (a, b) for which h’ (c) = 0. 
This completes the proof. 


REMARK The Mean Value Theorem is a theorem about real-valued functions of a 
real variable, and we will see later that it fails for complex-valued functions of a 
complex variable. (See part (f) of [link].) In fact, it can fail for a complex-valued 
function of a real variable. Indeed, if f(x) = u(x) + iv(z) is a continuous complex- 
valued function on the interval |a, b], and differentiable on the open interval (a, b), 
then the Mean Value Theorem certainly holds for the two real-valued functions wu and 
v, so that we would have 

Equation: 


f (b) — f (a) = u(b) — u(a) +4 (v(b) — v(a)) = w' (er) (b— a) + iv’ (cz) (b—), 


which is not f’ (c) (b — a) unless we can be sure that the two points c, and cz can be 
chosen to be equal. This simply is not always possible. Look at the function 
f (x) = x? + iz? on the interval (0, 1]. 


On the other hand, if f is a real-valued function of a complex variable (two real 
variables), then a generalized version of the Mean Value Theorem does hold. See part 
(c) of [link]. 


One of the first applications of the Mean Value Theorem is to show that a function 
whose derivative is identically 0 is necessarily a constant function. This seemingly 
obvious fact is just not obvious. The next exercise shows that this result holds for 
complex-valued functions of a complex variable, even though the Mean Value 
Theorem does not. 

Exercise: 


Problem: 


a. Suppose f is a continuous real-valued function on (a, b) and that f’ (x) = 0 
for all x € (a,b). Prove that f is a constant function on (a, b). HINT: Show 
that f(z) = f(a) for all x € [a, b] by using the Mean Value Theorem 
applied to the interval |a, 2]. 
b. Let f be a complex-valued function of a real variable. Suppose f is 
differentiable at each point x in an open interval (a, b), and assume that 
f' (x) = 0 for all x € (a,b). Prove that f is a constant function. HINT: Use 
the real and imaginary parts of f. 

. Let f be a complex-valued function of a complex variable, and suppose that 
f is differentiable on a disk B, (c) C C, and that f’ (z) = 0 for all 
z € B,(c). Prove that f(z) is constant on B, (c). HINT: Let z be an 
arbitrary point in B, (c), and define a function h : [0,1] + C by 
h(t) = f((1 — t)c + tz). Apply part (b) to h. 


io) 


The next exercise establishes, at last, two important identities. 
Exercise: 


Problem: (cos? + sin?= 1 and exp (i = —1.) 


a. Prove that cos” (z)+ sin? (z) = 1 for all complex numbers z. 


ie) 


oo 


. Prove that cos (7) = —1. HINT: We know from part (a) that cos (7) = +1. 


Using the Mean Value Theorem for the cosine function on the interval (0, 7], 
derive a contradiction from the assumption that cos (7) = 1. 

Prove that exp (i) = —1. HINT: Recall that 

exp (iz) =cos (z) +7 sin (z) for all complex z. (Note that this does not yet 
tell us that e*” = —1. We do not yet know that exp (z) = e”.) 

Prove that cosh? z— sinh? z = 1 for all complex numbers z. 


. Compute the derivatives of the tangent and hyperbolic tangent functions 


tan=sin / cos and tanh=sinh / cosh. Show in fact that 
Equation: 


1 
tan’ = 5 and tanh’= re 
cos cosh 


Here are two more elementary consequences of the Mean Value Theorem. 
Exercise: 


Problem: 


a. 


ie) 


d. 


Suppose f and g are two complex-valued functions of a real (or complex) 
variable, and suppose that f’ (x) = g’ (x) for all x € (a,b) (or x € B, (c).) 
Prove that there exists a constant & such that f(a) = g(x) + k for all 

x € (a,b) (oraz € B, (c).) 


. Suppose f’ (z) = c exp (az) for all z, where c and a are complex constants 


with a # 0. Prove that there exists a constant c’ such that 

f (z) = © exp (az) + c’. What if a = 0? 

(A generalization of part (a)) Suppose f and g are continuous real-valued 
functions on the closed interval [a, b], and suppose there exists a partition 
{xq < 21 <... < @y} of [a,b] such that both f and g are differentiable on 
each subinterval (x;-1, 2;). (That is, we do not assume that f and g are 
differentiable at the endpoints.) Suppose that f’ (2) = g! (x) for every x in 
each open subinterval (x;_1, 2;). Prove that there exists a constant & such 
that f(a) = g(x) + k for all x € [a, b]. HINT: Use part (a) to conclude that 
f =9+h where h is a step function, and then observe that h must be 
continuous and hence a constant. 

Suppose f is a differentiable real-valued function on (a, b) and assume that 
f' (a) 4 0 for all x € (a, b). Prove that f is 1-1 on (a, b). 


Exercise: 


Problem: 


Let f : [a,b] > R bea function that is continuous on its domain [a, b] and 
differentiable on (a, b). (We do not suppose that f’ is continuous on (a, b).) 


a. Prove that f is nondecreasing on |a, 6] if and only if f’ (x) > 0 for all 
x € (a,b). Show also that f is nonincreasing on |a, b] if and only if 
f' (x) < 0 for all x € (a, 5). 

b. Conclude that, if f’ takes on both positive and negative values on (a, b), 

then f is not 1-1. (See the proof of [link].) 

. Show that, if f’ takes on both positive and negative values on (a, b), then 
there must exist a point c € (a, b) for which f’ (c) = 0. (If f’ were 
continuous, this would follow from the Intermediate Value Theorem. But, we 
are not assuming here that f’ is continuous.) 

d. Prove the Intermediate Value Theorem for Derivatives: Suppose f is 
continuous on the closed bounded interval [a, b] and differentiable on the 
open interval (a, b). If f’ attains two distinct values 
v1 = f' (x1) < ve = f’ (x2), then f’ attains every value v between v; and 
vg. HINT: Suppose v is a value between v, and v2. Define a function g on 
|a, b] by g(x) = f(x) — vx. Now apply part (c) to g. 


io) 


Here is another perfectly reasonable and expected theorem, but one whose proof is 
tough. 
Inverse Function Theorem 


Suppose f : (a,b) — Risa function that is continuous and 1-1 from (a, b) onto the 
interval (a’, b’). Assume that f is differentiable at a point c € (a, b) and that 

f'(c) £0. Then f~ is differentiable at the point f(c), and 

Equation: 


The formula f~!’ (f (c)) = 1/f' (c) is no surprise. This follows directly from the 
Chain Rule. For, if f~ (f (x)) = x, and f and f~* are both differentiable, then 

f7l' (f£ (c))f' (c) = 1, which gives the formula. The difficulty with this theorem is in 
proving that the inverse function f~+ of f is differentiable at f(c). In fact, the first 
thing to check is that the point f(c) belongs to the interior of the domain of f 1 for 
that is essential if f—! is to be differentiable there, and here is where the hypothesis 


that f is a real-valued function of a real variable is important. According to [link], the 
1-1 continuous function f maps [a, 6] onto an interval [a’, b’], and f(c) is in the open 
interval (a’, b’), i.e., is in the interior of the domain of f~!. 


According to part (2) of [link], we can prove that f~* is differentiable at f(c) by 
showing that 
Equation: 


That is, we need to show that, given an € > 0, there exists a 6 > 0 such that if 
0 < |x — f(c)| < 6 then 
Equation: 


First of all, because the function 1/q is continuous at the point f’ (c), there exists an 
€’ > Osuch that if |g—f’ (c)| < e’, then 
Equation: 


Next, because f is differentiable at c, there exists a 6’ > 0 such that if 
0 < |y—c| < 6’ then 
Equation: 


—f'(c)| <eé. 


Now, by [link], f~! is continuous at the point f(c), and therefore there exists a d > 0 
such that if |e — f(c)| < 6 then 
Equation: 


If (@) — fF (FO < 6 


So, if |a — f(c)| < 6, then 
Equation: 


|f-! (e) — el =|f- 1 (2) — FTF (0) < &. 


But then, by [link], 


Equation: 
ie) cee 
Flgyce fF Ol<e 
from which it follows, using [link], that 
Equation: 
f(a) - F*(F (0) me 
T— 7c) i 
as desired. 


REMARK A result very like [link] is actually true for complex-valued functions of a 
complex variable. We will have to show that if c is in the interior of the domain S of a 
one-to-one, continuously differentiable, complex-valued function f of a complex 
variable, then f(c) is in the interior of the domain f($) of f~*. But, in the complex 
variable case, this requires a somewhat more difficult argument. Once that fact is 
established, the proof that f~! is differentiable at f(c) will be the same for complex- 
valued functions of complex variables as it is here for real-valued functions of a real 
variable. Though the proof of [link] is reasonably complicated for real-valued 
functions of a real variable, the corresponding result for complex functions is much 
more deep, and that proof will have to be postponed to a later chapter. See [link]. 


The Exponential and Logarithm Functions 

We derive next the elementary properties of the exponential and logarithmic 
functions. Of course, by “exponential function,” we mean the power series 
function exp. And, as yet, we have not even defined a logarithm function. 


We derive next the elementary properties of the exponential and logarithmic 
functions. Of course, by “exponential function,” we mean the power series 
function exp. And, as yet, we have not even defined a logarithm function. 
Exercise: 


Problem: 


a. Define a complex-valued function f : C — C by 
f(z) =exp (z) exp (—z). Prove that f(z) = 1 forall z € C. 

b. Conclude from part (a) that the exponential function is never 0, and 
that exp (—z) = 1/ exp (z). 

c. Show that the exponential function is always positive on R, and that 
limi 2 exp (an) 0, 

d. Prove that exp is continuous and 1-1 from (—©oo, co) onto (0, 00). 

Show that the exponential function is not 1-1 on C. 

Use parts b and e to show that the Mean Value Theorem is not in any 

way valid for complex-valued functions of a complex variable. 


rh oD 


Using part (d) of the preceding exercise, we make the following important 
definition. 


We call the inverse exp~! of the restriction of the exponential function to R 
the (natural) logarithm function, and we denote this function by In. 


The properties of the exponential and logarithm functions are strongly tied to the 
simplest kinds of differential equations. The connection is suggested by the fact, 
we have already observed, that exp’=exp. The next theorem, corollary, and 
exercises make these remarks more precise. 


Suppose f : C — Cis differentiable everywhere and satisfies the differential 
equation f’ = af, where a is a complex number. Then f(z) = c exp (az), 
where c = f(0). 


Consider the function h(z) = f(z)/ exp (az). Using the Quotient Formula, we 

have that 

Equation: 

fy i OO) Gs) eee ONT) EN) ag: 
[exp (az)]? [exp (z)|° 


Hence, there exists a complex number c such that h(z) = c for all z. Therefore, 
f(z) =c exp (az) for all z. Setting z = 0 gives f(0) = c, as desired. 
Law of Exponents 


For all complex numbers z and w,exp (z + w) =exp (z) exp (w). 


Fix w, define f(z) =exp (z+ w), and apply the preceding theorem. We have 
f' (z) =exp (z+ w) = f (2), so we get 
Equation: 


exp (z+ w) = f(z) = f(0) exp (z) =exp (w) exp (z). 


Exercise: 


Problem: 


a. If n is a positive integer and z is any complex number, show that 
exp (nz) = (exp (z))”. 
b. If r is a rational number and z is any real number, show that 


exp (rx) = (exp (x))". 


Exercise: 


Problem: 


a. Show that In is continuous and 1-1 from (0, oo) onto R. 

b. Prove that the logarithm function In is differentiable at each point 
y € (0, 00) and that In’ (y) = 1/y. HINT: Write y =exp (c) and use 
Theorem 4.10. 

c. Derive the first law of logarithms: In (zy) =In (x)+ In (y). 

d. Derive the second law of logarithms: That is, if r is a rational number 
and x is a positive real number, show that In (2”) = r In (2). 


We are about to make the connection between the number e and the exponential 
function. The next theorem is the first step. 


In (1) = Oand In (e) = 1. 


If we write 1 =exp (t), then t =In (1). But exp (0) = 1, so that In (1) = 0, 
which establishes the first assertion. 


Recall that 
Equation: 


Therefore, 
Equation: 


In (e) 


= le(4)) 
- in ((12)) 


1 
= limnIln (1+ =] 
n n 


This establishes the second assertion of the theorem. 
Exercise: 


Problem: 


a. Prove that 
Equation: 


HINT: Use the fact that the logarithm function is 1-1. 

b. For r a rational number, show that exp (r) = e’. 

c. If a is a positive number and r = p/q is a rational number, show that 
Equation: 


a” =exp (r In (a)). 


d. Prove that e is irrational. HINT: Let pp /qn be the nth partial sum of 
the series in part (a). Show that g, < n!, and that 
lim gn (€ — Pn/Gn) = 0. Then use [link]. 


We have finally reached a point in our development where we can make sense of 
raising any positive number to an arbitrary complex exponent. Of course this 
includes raising positive numbers to irrational powers. We make our general 
definition based on part (c) of the preceding exercise. 


For a a positive real number and z an arbitrary complex number, define a* 
by 
Equation: 


a* =exp (z In (a)). 


REMARK The point is that our old understanding of what a” means, where 

a > 0 and r isa rational number, coincides with the function exp (r In (a)). So, 
this new definition of a* coincides and is consistent with our old definition. And, 
it now allows us to raies a positive number a to an arbitrary complex exponent. 


REMARK Let the bugles sound!! Now, having made all the appropriate 
definitions and derived all the relevant theorems, we can finally prove that 


e’” — —]. From the definition above, we see that if a = e, then we have 
e* =exp (z). Then, from part (c) of [link], we have what we want: 


Equation: 
e™ =—-1 
Exercise: 
Problem: 
a. Prove that, for all complex numbers z and w,e**”’ = e%e”. 
b. If z is a real number and z is any complex number, show that 
Equation: 
(e”)* _ e* 
c. Let a be a fixed positive number, and define a function f : C — C by 
f (z) = a’. Show that f is differentiable at every z € C and that 
f' (z) =In (a)a’. 
d. Prove the general laws of exponents: If a and b are positive real 
numbers and z and w are complex numbers, 
Equation: 
are _— a’*a” 
y] 
Equation: 
a*b* = (ab)*, 
and, if z is real, 
Equation: 
gt = far)” 
e. If y is a real number, show that je |= 1. Ifz=2 + ty is a complex 
number, show that |e*|= e”. 
f. Let a = a + bi be a complex number, and define a function 
f : (0,00) > Cby f (x) = «* = e®™©), Prove that f is 
differentiable at each point x of (0, 00) and that f’ (x) = aa*!. 
g. Leta = a+ bi be as in part (f). For x > 0, show that |x°|= 2°. 


The Trigonometric and Hyperbolic Functions 
Two theorems covering differentiation of trigonometric and hyperbolic functions, including 
practice exercises corresponding to the theorems. 


The laws of exponents and the algebraic connections between the exponential function and 
the trigonometric and hyperbolic functions, give the following “addition formulas:” 


The following identities hold for all complex numbers z and w. 


Equation: 
sin (z + w) =sin (z) cos (w)+ cos (z) sin (w). 
Equation: 
cos (z + w) =cos (z) cos (w)— sin (z) sin (w). 
Equation: 
sinh (z+ w) =sinh (z) cosh (w)+ cosh (z) sinh (w). 
Equation: 


cosh (z+ w) =cosh (z) cosh (w)+ sinh (z) sinh (w). 


We derive the first formula and leave the others to an exercise. 


First, for any two real numbers x and y, we have 
Equation: 
cos (a+y)+isin(rt+y) = el 


e'tely 


(cos x+isin x) x (cos y+7sin y) 
= cos x cos y— sin x sin y+ i(cos z sin y+ sin x cos y), 


which, equating real and imaginary parts, gives that 
Equation: 
cos (x + y) =cos x cos y— sin x sin y 
and 
Equation: 


sin (x + y) =sin x cos y+ cos z sin y. 


The second of these equations is exactly what we want, but this calculation only shows that 
it holds for real numbers z and y. We can use the Identity Theorem to show that in fact this 
formula holds for all complex numbers z and w. Thus, fix a real number y. Let 

f(z) =sin z cos y+ cos z sin y, and let 

Equation: 


1 : : 
g(2) <sin (2-49) = 35 (ei? ete = 


Then both f and g are power series functions of the variable z. Furthermore, by the previous 
calculation, f(1/k) = g(1/k) for all positive integers k. Hence, by the Identity Theorem, 
f(z) = g(z) for all complex z. Hence we have the formula we want for all complex 
numbers z and all real numbers y. 


To finish the proof, we do the same trick one more time. Fix a complex number z. Let 
f(w) =sin z cos w+ cos z sin w, and let 
Equation: 


1 : . 
gu) =sin (2+-u) = 3-(et) ete) = 


Again, both f and g are power series functions of the variable w, and they agree on the 
sequence {1/k}. Hence they agree everywhere, and this completes the proof of the first 
addition formula. 

Exercise: 


Problem: 
a. Derive the remaining three addition formulas of the preceding theorem. 


b. From the addition formulas, derive the two “half angle” formulas for the 
trigonometric functions: 


Equation: 
1— 2 
sin? (z) = pac za ‘ 
2 
and 
Equation: 
1 2 
eres ees: 


2 


The trigonometric functions sin and cos are periodic with period 27; i.e., 
sin (z + 27) =sin (z) and cos (z + 27) =cos (z) for all complex numbers z. 


We have from the preceding exercise that 

sin (z+ 27) =sin (z) cos (27)+ cos (z) sin (277), so that the periodicity assertion for the 
sine function will follow if we show that cos (27) = 1 and sin (27) = 0. From part (b) of 
the preceding exercise, we have that 

Equation: 


1— 2 
0 =sin’ (7) = +008 Cn) 


which shows that cos (277) = 1. Since cos” + sin?= 1, it then follows that sin (27) = 0. 


The periodicity of the cosine function is proved similarly. 
Exercise: 


Problem: 


a. Prove that the hyperbolic functions sinh and cosh are periodic. What is the 
period? 

b. Prove that the hyperbolic cosine cosh (a) is never 0 for x a real number, that the 
hyperbolic tangent tanh (2) =sinh (x)/ cosh (x) is bounded and increasing 
from R onto (—1, 1), and that the inverse hyperbolic tangent has derivative given 
by tanh’ (y) =1/ (1 —-y’). 

c. Verify that for all y € (—1, 1) 

Equation: 


Exercise: 
Polar coordinates 


Problem: 


Let z be a nonzero complex number. Prove that there exists a unique real number 
0 < @ < 2m such that z = re”, where r = |z|. 


HINT: If z = a + bi, then z = r(# + $4. Observe that T= 2s Tea d< : < I, 
and (@y + (2) = 1. Show that there exists a unique 0 < @ < 27 such that 


i =cos @ and fh =sin 0. 


L'Hopital's Rule 
The caught mean value theorem and L'Hopital's rule are included in this module, 
accompanied by practice exercises. 


Many limits of certain combinations of functions are difficult to evaluate because 
they lead to what's known as “indeterminate forms.” These are expressions of the 
form 0/0,00 /00,0°,00 — 00,1%, and the like. They are precisely combinations of 
functions that are not covered by our limit theorems. See [link]. The very definition 
of the derivative itself is such a case: lim, 49 (f (ce + h) — f (c)) = 0, 

limp_59 h = 0, and we are interested in the limit of the quotient of these two 
functions, which would lead us to the indeterminate form 0/0. The definition of 
the number e is another example: lim (1 + 1/n) = 1,lim n = oo, and we are 
interested in the limit of (1 + 1/n)"”, which leads to the indeterminate form 1°. 
L'Hopital's Rule, [link] below, is our strongest tool for handling such indeterminate 
forms. 


To begin with, here is a useful generalization of the Mean Value Theorem. 
Cauchy Mean Value Theorem 


Let f and g be continuous real-valued functions on a closed interval |a, b], suppose 
g(a) # g(b), and assume that both f and g are differentiable on the open interval 
(a,b). Then there exists a point c € (a, b) such that 

Equation: 


f(b) — fla) _ fle) 
g(b)— g(a) —g’(c) 


Exercise: 


Problem: Prove the preceding theorem. 


HINT: Define an auxiliary function h as was done in the proof of the original 
Mean Value Theorem. 


The following theorem and exercise comprise what is called L'Hopital's Rule. 


Suppose f and g are differentiable real-valued functions on the bounded open 
interval (a, b) and assume that 
Equation: 


t—a+0 g/ (x) 


where L is a real number. (Implicit in this hypothesis is that g’ (2) ¢ 0 for z in 
some interval (a,a + a).) Suppose further that either 


Equation: 
a f (x) =a g (x) =o 
or 
Equation: 
Bey f (x) meee g (x) = ON 
then 
Equation: 


f(x) 


oat0 g(x) 


Suppose first that 
Equation: 
lim f(z) = lim g(a) =0. 


xz—a+0 z—a+0 


Observe first that, because g’ (a) 4 0 for all x in some interval (a, a + a),g' (x) is 
either always positive or always negative on that interval. (This follows from part 
(d) of [link].) Therefore the function g must be strictly monotonic on the interval 
(a,a + a). Hence, since lim;_,a+0 g(x) = 0, we must have that g(a) 4 0 on the 
interval (a,a + aq). 


Now, given an € > 0, choose a positive 6 < a@ such that ifa < ec < a+ then 
f'(e) 
g(c) 
a<x<a+d, we have by the Cauchy Mean Value Theorem that there exists a 
point c between a + 1/n and x such that 


— L| < e. Then, for every natural number n for which 1/n < 6, and every 


Equation: 


—L|<e. 


g(x) — g(a + 1/n) g (c) 


Therefore, taking the limit as m approaches oo, we obtain 
Equation: 


= € 


2) — 
g(x) B= lie, 


| f(x) — f(a+1/n) 
g(x) — g(a + 1/n) 


for all x for which a < x < a+ 0. This proves the theorem in this first case. 


Next, suppose that 
Equation: 


an f (x) ean g (x) pane 


This part of the theorem is a bit more complicated to prove. First, choose a positive 
a so that f(x) and g(x) are both positive on the interval (a, a + a). This is 
possible because both functions are tending to infinity as z approaches a. Now, 
given an € > O, choose a positive number 8 < a@ such that 

Equation: 


2 


for alla <c < a+. We express this absolute value inequality as the following 
pair of ordinary inequalities: 
Equation: 


€ 
<L+—. 
_ 


Set y = a+ (. Using the Cauchy Mean Value Theorem, and the preceding 
inequalities, we have that for alla <x < y 


Equation: 


implying that 
Equation: 


(Z- 5) ((@) —9(y)) + FW) < F@) < (L+ 5) (ole) - 9) + FY. 


Dividing through by g(a) and simplifying we obtain 


Equation: 
—e (L-$)9ly) , fy) — f@) e (L+s)9(y) . fy) 
e237 giz) * ga) Soa) ST 2 gle) le) 


Finally, using the hypothesis that lim, .,19 g(a) = oo, and the fact that 
L, €, g(y), and f(y) are all constants, choose a 6 > 0, with 6 < 6, such that if 
a<«x<a+d6, then 


Equation: 
(L—+)o(y) — f(y) 
| + ec £ 
9(z) g(x) 
and 
Equation: 
(L+$)9(y) — f(y) 
|- pivie= 
g(x) (x) 
Then, for alla < x < a+, we would have 
Equation: 
L-e< Ae) <L+e, 


implying that 
Equation: 


and the theorem is proved. 
Exercise: 


Problem: 
a. Show that the conclusions of the preceding theorem also hold if we 


assume that 
Equation: 


f' (@) 


Patent, g’ (x) 


HINT: Replace e€ by a large real number B and show that 
f(x)/g(z) > BifO<x2-a<o. 

b. Show that the preceding theorem, as well as part (a) of this exercise, also 
holds if we replace the (finite) endpoint a by —oo. HINT: Replace the 6's 
by negative numbers B. 

c. Show that the preceding theorem, as well as parts a and b of this 
exercise, hold if the limit as x approaches a from the right is replaced by 
the limit as x approaches b from the left. HINT: Replace f(x) by f(—a) 
and g(x) by g(—2). 

d. Give an example to show that the converse of L'Hopital's Rule need not 
hold; i.e., find functions f and g for which 
limz+a+0 f(z) =limz+a+0 g(x) = 0, 

Equation: 


does not exist. 


! 
(2) exists, but lim f(z) 
xz—a+0 g(x) t—a+0 g! xr) 


e. Deduce from the proof given above that if lim, .a;09 f’ (x)/g' (x) = L 
and lim;-,a10 g(x) = oo, then lim, 440 f ()/g (x) = L independent 
of the behavior of f. 


f. Evaluate lim,_,., z!/*, and lim, 49 (1 — a)! *. HINT: Take logarithms. 


Higher Order Derivatives 
A series of definitions of terminology used to describe higher-order 
derivatives, and a formula for the coefficients of a taylor series function. 


Let S be a subset of R (or C), and Let f : S + C'bea function of a 
real (or complex) variable. We say that f is continuously differentiable 
on $° if f is differentiable at each point z of S° and the function f’ is 
continuous on S°. We say that f € C1(S) if f is continuous on S$ and 
continuously differentiable on S°. We say that f is 2-times 
continuously differentiable on S° if the first derivative f’ is itself 
continuously differentiable on S°. And, inductively, we say that f is k- 
times continuously differentiable on S° if the k — 1st derivative of f is 
itself continuously differentiable on S°. We write f‘*) for the kth 
derivative of f, and we write f € C*(S) if f is continuous on S' and 
is k times continuously differentiable on S°. Of course, if f € C*(S), 
then all the derivatives f (3) for 7 < k, exist nd are continuous on S . 
(Why?) 


For completeness, we define f (0) to be f itself, and we say that 

f € C®™ (S) if f is continuous on Sand has infinitely many 
continuous derivatives on S v. i.e., all of its derivatives exist and are 
continuous on 9°. 


As in [link], we say that f is real-analytic (or complex-analytic) on S 
if it is expandable in a Taylor series around each point c € S® 


REMARK Keep in mind that the definition above, as applied to functions 
whose domain S is a nontrivial subset of C’, has to do with functions of a 
complex variable that are continuously differentiable on the set S°. We 
have seen that this is quite different from a function having continuous 
partial derivatives on S°. We will return to partial derivatives at the end of 
this chapter. 


Let S be an open subset of R (or C). 


1. Suppose W'S is a subset of R. Then, for each k > 1, there exists a 
function in C* ($) that is not in C*** (S$). That is, C*' (S) isa 


proper subset of C* (iS). 

2. If f is real-analytic (or complex-analytic) on S, then f € C'™(S). 

3. There exists a function in C™ (R) that is not real-analytic on R. That 
is, the set of real-analytic functions on R is a proper subset of the set 


C™ (R). 


REMARK Suppose § is an open subset of C’. It is a famous result from the 
Theory of Complex Variables that if f is in C’ (9), then f is necessarily 
complex analytic on S. We will prove this amazing result in [link]. Part (3) 
of the theorem shows that the situation is quite different for real-valued 
functions of a real variable. 


For part (1), see the exercise below. Part (2) is immediate from part (c) of 
[link]. Before finishing the proof of part (3), we present the following 
lemma: 


Let f be the function defined on all of R as follows. 
Equation: 


f(a) = joe < oP Es 9 


where p(z) is a fixed polynomial function and n is a fixed nonnegative 
integer. Then f is continuous at each point x of R. 


The assertion of the lemma is clear if « ~ 0. To see that f is continuous at 
O, it will suffice to prove that 
Equation: 


(Why?) But, for x > 0, we know from part (b) of [link] that 
e'/® > 1/(2"*" (n + 1)!), implying that e~!/* < 2" (n + 1)!. Hence, 
for 2 > 0, 


Equation: 


and this tends to 0 as x approaches 0 from the right, as desired. 


Returning to the proof of [link], we verify part (3) by observing that if f is 
as in the preceding lemma then f is actually differentiable, and its 
derivative f’ is a function of the same sort. (Why?) It follows that any such 
function belongs to C® (R). On the other hand, a nontrivial such f cannot 
be expandable in a Taylor series around 0 because of the Identity Theorem. 
(Take x, = —1/k.) This completes the proof. 

Exercise: 


Problem: 


a. Prove part (1) of [link]. Use functions of the form zx” sin (1/z). 

b. Prove that any function of the form of the f in the lemma above is 
everywhere differentiable on R, and its derivative has the same 
form. Conclude that any such function belongs to C'® (R). 

c. For each positive integer n, define a function f,, on the interval 
(—1,1 by fn (x) = |a|'*’/”. Prove that each f, is differentiable 
at every point in (—1, 1), including 0. Prove also that the 
sequence { f,,} converges uniformly to the function f(x) = |z|. 
(See part (h) of [link].) Conclude that the uniform limit of 
differentiable functions of a real variable need not be 
differentiable. (Again, for functions of a complex variable, the 
situation is very different. In that case, the uniform limit of 
differentiable functions is differentiable. See [link].) 


Exercise: 
A smooth approximation to a step function. 


Problem: 


Suppose a < 6 < c < dare real numbers. Show that there exists a 
function x in C°° (R) such that 0 < x(a) < 1 for all x,x(x) = 1 for 
x € [b,c], and x(x) = 0 for x ¢ (a, d). (If ais close to b and c is 
close to d, then this function is a C™ approximation to the step 
function that is 1 on the interval |b, c] and 0 elsewhere.) 


a. Let f be a function like the one in the lemma. Think about the 
graphs of the functions f(a — c) and f(b — x). Construct a C'* 
function g that is 0 between 6 and c and positive everywhere else. 

b. Construct a C’™ function h that is positive between a and d and 0 
everywhere else. 

c. Let g and h be as in parts (a) and (b). If 7 = g +h, show that 7 is 
never 0, and write & for the C'® function k = 1/7. 

d. Examine the function hk, and show that it is the desired function 
x. 


Formula for the coefficients of a Taylor Series function 


Let f be expandable in a Taylor series around a point c : 
Equation: 


Fiz) = > An(xz —c)”. 


Then for each n,an = f\” (c)/nl. 


Because each derivative of a Taylor series function is again a Taylor series 
function, and because the value of a Taylor series function at the point c is 
equal to its constant term ao, we have that a; = f’ (c). Computing the 
derivative of the derivative, we see that 2a2 = f” (c) = f®) (c). 
Continuing this, i.e., arguing by induction, we find that n!a,, = f (c), 
which proves the theorem. 


Taylor Polynomials and Taylor's Remainder Theorem 
This module contains taylor's remainder theorem, some remarks about 
Taylor series, and a test for local maxima and minima. 


Let f be in C” (B, (c)) for ca fixed complex number, r > 0, andna 
positive integer. Define the Taylor polynomial of degree n for f at c to 


be the polynomial 7’ = Ti °) given by the formula: 


Equation: 


(Ti) (z) = 3 a,(2=<)’, 


where a; = f) (c)/j!. 


REMARK If f is expandable in a Taylor series on B,. (c), then the Taylor 
polynomial for f of degree n is nothing but the nth partial sum of the 
Taylor series for f on B, (c). However, any function that is n times 
differentiable at a point c has a Taylor polynomial of order n. Functions that 
are infinitely differentiable have Taylor polynomials of all orders, and we 
might suspect that these polynomials are some kind of good approximation 
to the function itself. 

Exercise: 


Problem: 


Prove that f is expandable in a Taylor series function around a point c 
(with radius of convergence r > OQ) if and only if the sequence 


ee 7, of Taylor polynomials converges pointwise to f; i.e., 


Equation: 


for all z in B, (c). 


Exercise: 


Problem: 

Let f € C” (B, (c)). Prove that f’ ¢ C" + (B, (c)). Prove also that 
/ 

(Ta) = Tha: 


The next theorem is, in many ways, the fundamental theorem of 
numerical analysis. It clearly has to do with approximating a general 
function by polynomials. It is a generalization of the Mean Value 
Theorem, and as in that case this theorem holds only for real-valued 
functions of a real variable. 


Taylor's Remainder Theorem 


Let f be a real-valued function on an interval (c — r,c +r), and assume 
that f € OC" ((e—r,c+r)), and that f is differentiable on 
(c—r,c+r). Then, for each x in (c — r,c +r) there exists a y between c 
and x such that 

Equation: 


Hi ‘ake (y) n+1 
f (x) — (Té,.) (x) = aor ao ae (4. 7) 


REMARK If we write f (z) = T/,) (v)+Rn+1 (@), where Rn+1 (2) is 


the error or remainder term, then this theorem gives a formula, and hence an 
estimate, for that remainder term. This is the evident connection with 
Numerical Analysis. 


We prove this theorem by induction on n. For n = Q, this is precisely the 
Mean Value Theorem. Thus, 
Equation: 


f (x) —T;.. (e) = f(2)- fl) =f' (y)(@-«. 


Now, assuming the theorem is true for all functions in 
C”!((e—r,c+7r)), let us show it is true for the given function 


fec"((c—r,c+r)). Set g(x) = f(x) - (Te) (x) and let 

h (x) = (a —c)"*. Observe that both g(c) = 0 and A(c) = 0. Also, if 
x #c, then h(x) 4 0. So, by the Cauchy Mean Value Theorem, we have 
that 

Equation: 


)  g(z)-g(e) — g'(w) 


(c) A(a)—h(c) hh! (w) 


So) 
— 
8 


— 


for some w between c and x. Now 
Equation: 


o! (w) =F! (w) — (fy) (w) =F" (w) — (Teed) (ew) 


(See the preceding exercise.), and h’ (w) = (n+ 1)(w — c)”. Therefore, 
Equation: 


fe) (Ma) _— 


(a —c)""* h(x) 
_ gw) 
h' (w) 
f' (w) — (Tee) (w) 
~ (n+ 1)(w—o)” 


We apply the inductive hypotheses to the function f’ (which is in 
C”™1 ((e —r,ce + r))) and obtain 
Equation: 


f(e)- (Te) @) Fw) - (75-3) @) 
(a —c)"" (n+ 1)(w—e)" 


r(n) 
iy) (w—c)” 


ni 


(n+1)(w—c)" 
FG) 


(n+ 1)! 


ORG) 
(n+ 1)! 


for some y between c and w. But this implies that 
Equation: 


‘i i UVeso 
f(z) - (Té,.)) (x) = (n+ ” 


for some y between c and z, which finishes the proof of the theorem. 
Exercise: 


Problem: 


Define f(x) = 0 for x < Oand f (z) =e 1/* for x > 0. Verify that 
f © C®™(R), that f( (0) = 0 for all n, and yet f is not expandable 
in a Taylor series around 0. Interpret Taylor's Remainder Theorem for 
this function. That is, describe the remainder R,,+1 (2). 


As a first application of Taylor's Remainder Theorem we give the following 
result, which should be familiar from calculus. It is the generalized version 
of what's ordinarily called the “second derivative test.” 

Test for Local Maxima and Minima 


Let f be a real-valued function in C” (c — r,c +r), suppose that the n + 1 
st derivative f("+) of f exists everywhere on (c — r,c +r) and is 


continuous at c, and suppose that f (h) (c) = 0 for all 1 < k < nand that 
f°» (c) 4 0. Then: 


1. If n is even, f attains neither a local maximum nor a local minimum at 
c. In this case, c is called an inflection point. 

2. If n is odd and f("+ (c) < 0, then f attains a local maximum at c. 

3. If n is odd and f'"*) (c) > 0, then f attains a local minimum at c. 


Since f! ("+1) is continuous at c, there exists a 6 > O such that i kel) ) (y) 
has the same sign as f("+) (ce) for all y € (ec — 6,c +5). We have by 
Taylor's Theorem that if x € (c — 6,c + 6) then there exists a y between x 
and c such that 


Equation: 
F(2) = (Tho) (e+ Wear, 
from which it follows that 
Equation: 
Ke) fl) = Os mle of + Feo 
= r wo ent 


Suppose n is even. It follows then that if x < c, the sign of (a — eye is 
negative, so that the sign of f(a) — f(c) is the opposite of the sign of 
f"* (ce). On the other hand, if a > c, then (x — c)"*' > 0, so that the 
sign of f(x) — f(c) is the same as the sign of f("*) (ec). So, re Vefle) 
for all nearby x on one side of c, while f(a) < f(c) for all nearby x on the 
other side of c. Therefore, f attains neither a local maximum nor a local 
minimum at c. This proves part (1). 


Now, if n is odd, the sign of f(a) — f(c) is the same as the sign of 

f* (y), which is the same as the sign of f("*») (c), for all 

x € (c—6,c+6). Hence, if f+) (c) < 0, then f(x) — f(c) < 0 for all 
xz € (c— 6,c + 4), showing that f attains a local maximum at c. And, if 
f'"*» (c) > 0, then the sign of f(x) — f(c) is positive for all 

x € (c— 6,c +4), showing that f attains a local minimum at c. This 
proves parts (2) and (3). 


The General Binomial Theorem 

We use Taylor's Remainder Theorem to derive a generalization of the 
Binomial Theorem to nonintegral exponents. First we must generalize the 
definition of binomial coefficient. 


We use Taylor's Remainder Theorem to derive a generalization of the 
Binomial Theorem to nonintegral exponents. First we must generalize the 
definition of binomial coefficient. 


Let a be a complex number, and let k be a nonnegative integer. We 
define the general binomial coefficient (¢ ) by 
Equation: 


@ _ a(a — ae —k+1) 


If a is itself a positive integer and k < a, then (7) agrees with the 
earlier definition of the binomial coefficient, and (7) = 0 when 

k > a. However, if a is not an integer, but just an arbitrary complex 
number, then every Ce) al); 


Exercise: 


Problem: 


Estimates for the size of binomial coefficients. Let a be a fixed 
complex number. 


a. Show that 
Equation: 


for all nonnegative integers k. HINT: Note that 
Equation: 


(a) I< |a|(jalpha| + 1)(|alpha| + 2)...(ja| +k — 1) 


k k! 
b. Use part (a) to prove that there exists a constant C’ such that 
Equation: 
a 
<2" 
(Is 
for all nonnegative integers k. HINT: Note that (1 + |a|/7) < 2 
for all 7 > |al. 
c. Show in fact that for each e > 0 there exists a constant C’. such 
that 
Equation: 


(Qicea+e 


for all nonnegative integers k. 
d. Let h(t) be the power series function given by 
h(t) = 37-9 ({)t*. Use the ratio test to show that the radius of 


convergence for h equals 1. 


REMARK The general Binomial Theorem, if there is one, should be 
something like the following: 
Equation: 


The problem is to determine when this infinite series converges, i.e., for 
what values of the three variables x, y, and a does it converge. It certainly 
is correct if x = 0, so we may as well assume that x 4 0, in which case we 
are considering the validity of the formula 

Equation: 


(c+y)* =2°%(14+t)% = 2 (7) 


where t = y/z. Therefore, it will suffice to determine for what values of t 
and @ does the infinite series 


Equation: 
(Ot 
(i) 
k=0 
equal 
Equation: 


G2i*. 


The answer is that, for n arbitrary complex number a, this series converges 
to the correct value for all ¢ € (—1, 1). (Of course, t must be larger than —1 
for the expression (1 + aie even to be defined.) However, the next theorem 
only establishes this equality for t's in the subinterval (—1/2, 1/2). As 
mentioned earlier, its proof is based on Taylor's Remainder Theorem. We 
must postpone the complete proof to [link], where we will have a better 
version of Taylor's Theorem. 


Let a = a + bi be a fixed complex number. Then 
Equation: 


for allt € (—1/2,1/2). 


Of course, this theorem is true if a is a nonnegative integer, for it is then 
just the original Binomial Theorem, and in fact in that case it holds for 


every complex number t. For a general complex number a, we have only 
defined x° for positive z's, so that (1 + t)® is not even defined for t < —1. 


Now, for a general a = a + bi, consider the function 

g: (—1/2,1/2) — C defined by g(t) = (1 + t)*. Observe that the nth 
derivative of g is given by 

Equation: 


a(a —1)..(a—n+1) . 


(n) = 
ge (1+t)" * 


Then g € C®™ ((—1/2, 1/2)). (Of course, g is actually in C'™ (—1, 1), but 
the present theorem is only concerned with t's in (—1/2, 1/2).) 


For each nonnegative integer k define 
Equation: 


a, = 9) (0)/k! = a(a — HteTk + = a 


and set h equal to the power series function given by h(t) = >, axt*. 
According to part (d) of the preceding exercise, the radius of convergence 
for the power series). a,t* is 1. The aim of this theorem is to show that 
g(t) = h(t) for all —1/2 < t < 1/2. In other words, we wish to show that 
g agrees with this power series function at least on the interval 

(—1/2, 1/2). It will suffice to show that the sequence {S,,} of partial sums 
of the power series function h converges to the function g, at least on 
(—1/2, 1/2). We note also that the nth partial sum of this power series is 
just the nth Taylor polynomial T; for g. 

Equation: 


Q 2. g*) (0 
S, (t) = GY — > oO 


Now, fix a t strictly between —1/2 and 1/2, and let r < 1 be as in part (c) 
of [link]. That is, |t/(1 + y)| < r for every y between 0 and t. (This is an 
important inequality for our proof, and this is one place where the 
hypothesis that ¢ € (—1/2, 1/2) is necessary.) Note also that, for any 

y € (—1/2, 1/2), we have |(1 + y)°|= (1 + y)”, and this is trapped 
between (1/2)* and (3/2)*. Hence, there exists a number M such that 
(1+ y)“|< M for all y € (—1/2, 1/2). 


Next, choose an € > 0 for which G = (1+ €)r < 1. We let C, bea 
constant satisfying the inequality in Part (c) of [link]. So, using Taylor's 
Remainder Theorem, we have that there exists a y between 0 and ¢ for 
which 

Equation: 


|g (t)— S- ant” 
k=0 


oY) pnt 
(n+ 1)! 
a(a —1)...(a—n) 


(n+1)(1+y)"* 


(3 jlet aye 


nt+1 


nt+1 


n+1 


t 
C. tee Vy — 
( ) 1l+y 


OM fi. e)"* Mr" 
C.Mp"™"", ' 


IA 


IN IA 


Taking the limit as n tends to oo, and recalling that 6 < 1, shows that 
g(t) = h(t) for all —1/2 < t < 1/2, which completes the proof. 


More on Partial Derivatives 

We close the chapter with a little more concerning partial derivatives. Thus far, we have 
discussed functions of a single variable, either real or complex. However, it is difficult not to 
think of a function of one complex variable z = x + zy as equally well being a function of the 
two real variables x and y. We will write (a, b) and a + bi to mean the same point in C = R?, 
and we will write |(a, b)| and |a + bi| to indicate the same quantity, i.e., the absolute value of 
the complex number a + bi = (a, b). We have seen before that the only real-valued, 
differentiable functions of a complex variable are the constant functions. However, this is far 
from the case if we consider real-valued functions of two real variables, as is indicated 
previously. Consequently, we make the following definition of differentiability of a real-valued 
function of two real variables. Note that it is clearly different from the definition of 
differentiability of a function of a single complex variable, and though the various notations for 
these two kinds of differentiability are clearly ambiguous, we will leave it to the context to 
indicate which kind we are using. 


We close the chapter with a little more concerning partial derivatives. Thus far, we have 
discussed functions of a single variable, either real or complex. However, it is difficult not to 
think of a function of one complex variable z = x + zy as equally well being a function of the 
two real variables x and y. We will write (a,b) and a + bi to mean the same point in C = R?, 
and we will write |(a, b)| and |a + b2| to indicate the same quantity, i.e., the absolute value of 
the complex number a + bi = (a, b). We have seen in [link] that the only real-valued, 
differentiable functions of a complex variable are the constant functions. However, this is far 
from the case if we consider real-valued functions of two real variables, as is indicated in [link]. 
Consequently, we make the following definition of differentiability of a real-valued function of 
two real variables. Note that it is clearly different from the definition of differentiability of a 
function of a single complex variable, and though the various notations for these two kinds of 
differentiability are clearly ambiguous, we will leave it to the context to indicate which kind we 
are using. 


Let f : S + Rbea function whose domain is a subset S of R?, and let c = (a,b) bea 
point in the interior S° of '. We say that f is differentiable, as a function of two real 
variables, at the point (a, b) if there exists a pair of real numbers Lj and Lz and a function 
8 such that 

Equation: 


f (a + hy, b + hg) a= f (a, b) = Lyhy, + Laho + 0 (hi, hg) 


and 
Equation: 


(ha, he) 


im 
|(hi,h2)|0 |(hy, h2)| 


One should compare this definition with part (3) of [link]. 


Each partial derivative of a function f is again a real-valued function of two real variables, 
and so it can have partial derivatives of its own. We use simplifying notation like fry. and 
fuyyzyy... to indicate “higher order” mixed partial derivatives. For instance, f;2y. denotes 
the fourth partial derivative of f, first with respect to x, second with respect to x again, 
third with respect to y, and finally fourth with respect to x. These higher order partial 
derivatives are called mixed partial derivatives. 


Suppose S$ is a subset of R?, and that f is a continuous real-valued function on S. If both 
partial derivatives of f exist at each point of the interior S° of S, and both are continuous 
on S°, then f is said to belong to C1 (S). If all kth order mixed partial derivatives exist at 
each point of S°, and all of them are continuous on S$°, then f is said to belong to C* (iS). 
Finally, if all mixed partial derivatives, of arbitrary orders, exist and are continuous on S°, 


then f is said to belong to C'™ (S). 
Exercise: 
Problem: 
a. Suppose f is a real-valued function of two real variables and that it is differentiable, as 


a function of two real variables, at the point (a, b). Show that the numbers L; and Ly 
in the definition are exactly the partial derivatives of f at (a, b). That is, 


Equation: 
= ne ) =I f(a +h, b) — f(a, b) 
1 tale 7? To h 
and 
Equation: 
_ ually jin f(a,b+h) — f(a,b) 
> ‘tialy ea 7) h 


b. Define f on R? as follows: f(0,0) = 0, and if (x, y) 4 (0, 0), then 
f (é,9) =2y/ (x? + y’). Show that both partial derivatives of f at (0,0) exist and 
are 0. Show also that f is not, as a function of two real variables, differentiable at 
(0,0). HINT: Let A and & run through the numbers 1/n. 

. What do parts (a) and (b) tell about the relationship between a function of two real 
variables being differentiable at a point (a, b) and its having both partial derivatives 
exist at (a, b)? 

d. Suppose f = u + iv is a complex-valued function of a complex variable, and assume 

that f is differentiable, as a function of a complex variable, at a point 

c=a+ bi = (a,b). Prove that the real and imaginary parts u and v of f are 
differentiable, as functions of two real variables. Relate the five quantities 
Equation: 


a 


tialu tialu tialv tialv F 
Hale b), tialy (a, b), Hale 6% b), faly b), and f (ey 


Perhaps the most interesting theorem about partial derivatives is the “mixed partials are equal” 
theorem. That is, fry = fy. The point is that this is not always the case. An extra hypothesis is 
necessary. 

Theorem on mixed partials 


Let f : S — R be such that both second order partials derivatives f,, and fy, exist at a point 
(a, b) of the interior of S', and assume in addition that one of these second order partials exists at 
every point in a disk B, (a, 6) around (a, 6) and that it is continuous at the point (a, b). Then 


try (a, b) = Fyx (a, b). 


Suppose that it is fy. that is continuous at (a, 6). Let € > 0 be given, and let 6; > 0 be such that 
if |(c,d) — (a, b)| < 6; then | fyx (c,d) — fye (a, b)| < €. Next, choose a 2 such that if 

0 < |k| < do, then 

Equation: 


fry (a, 6) — BOOP A FO) é, 


and fix such a k. We may also assume that |k| < 61/2. Finally, choose a 63 > 0 such that if 
0 < |h| < 43, then 


Equation: 
h,b+k)—f(a,b+k 
fr(a,b +k) — Rosner i Meer) |. lle, 
and 
Equation: 
f(a+h,b) — f(a,b 
fz(a,6) — REPRO KOO ae 


and fix such an h. Again, we may also assume that |h| < 6/2. 


In the following calculation we will use the Mean Value Theorem twice. 
Equation: 


0 < |fey(a,b) — fe (a, b)| 
< | fey (a,b) ee 
2(a,b+k) — fz (a,b 
pene eG) 
f(ath,b+k)—f(a,b+k) 
Cd rma nes 
k 
) florh?)- flat) fe (a, b) 
k 
h,b +k) — fla,b+k h, b) — f(a,b 
PACES at eer Ana Ca ea ey 
= fla+hb+h)— flab ad t (f(a +h, b) — f(a, 6)) fre (a, b)| 
= 3p) EEN = flO Oa 
= 3e+|fye (a',") — fye (a, 6)| 
< 4e, 


because 0’ is between b and b + k, and a’ is between a and a + h, so that 

\(a’, b') — (a, b)|< 51/2 < 6;. Hence, | fry (a,b) — fye (a, b) < 4e, for an arbitrary €, and so 
the theorem is proved. 

Exercise: 


Problem: 
Let f be defined on R? by f(0,0) = 0 and, for (x,y) (0,0), f (a, y) = x°y/ (a? + y’). 


a. Prove that both partial derivatives f, and f, exist at each point in the plane. 
b. Show that fyz (0,0) = 1 and fry (0,0) = 0. 
c. Show that f,,., exists at each point in the plane, but that it is not continuous at (0, 0). 


The following exercise is an obvious generalization of the First Derivative Test for Extreme 
Values, [link], to real-valued functions of two real variables. 
Exercise: 


Problem: 
Let f : S + Rbea real-valued function of two real variables, and let c = (a, b) € S° bea 


point at which f attains a local maximum or a local minimum. Show that if either of the 
partial derivatives tial f /tialz or tial f /tialy exists at c, then it must be equal to 0. 


HINT: Just consider real-valued functions of a real variable like « — f(x, b) or 
y — f(a, y), and use [link]. 


Whenever we make a new definition about functions, the question arises of how the definition 
fits with algebraic combinations of functions and how it fits with the operation of composition. 
In that light, the next theorem is an expected one. 


(Chain Rule again) Suppose S is a subset of R?, that (a,b) is a point in the interior of S, and 
that f : S — Risa real-valued function that is differentiable, as a function of two real variables, 
at the point (a, b). Suppose that T is a subset of R, that c belongs to the interior of T’, and that 

y : T —+ R’ is differentiable at the point c and y(c) = (a,b). Write p(t) = (x(t), y(t)). Then 
the composition f o y is differentiable at c and 

Equation: 


_ tialf 


fow'(c) = = (a,b)a!(c) + 


tial f 
tialy 


_ tialf 
— tialx 


tialf 
tialy 


(a, b)y'(c) (2 (c))a"(e) + (2 (c))y! (c). 


From the definition of differentiability of a real-valued function of two real variables, write 
Equation: 


f(at+hi,b+ he) — f (a,b) = Lyhi + Lohe + 07 (Ai, he). 


and from part (3) of [link], write 
Equation: 


y(c+h)—v(c) =¢' (ch + 4 (h), 


or, in component form, 
Equation: 


a(c+h)—a(c)=a2(c+h)—a=2' (c)h+ 6, (h) 


and 
Equation: 


y(ct+h)—y(c) =y(e+h) -b=y (cht 6y(h). 


We also have that 
Equation: 


O¢ (Ai, ha)) 


lim, 2 
I(hi,ha)|30 | (hy, h2)| 


Equation: 


he 
and 
Equation: 
6, (h 
lim y(h) 0. 
ho0 6h 


We will show that f o y is differentiable at c by showing that there exists a number L and a 
function @ satisfying the two conditions of part (3) of [link]. 


Define 
Equation: 


ky (h), ko (h)) = ple +h) — ve) = (a(e+h)—2(c),y(e+h) —y(e)). 


Thus, we have that 
Equation: 


fogle+h)—fopl(c) = fly(e+h)) — fl(y(e)) 


= flxle+h),y(e+h)) — fla(c), y(e)) 

= fla+ ki (h), b+ ko (h ) — f (a,) 

= [yk (h) + Lok (h) + 0¢ (ki (A), ke (h)) 

= h(a(ct+h)—a(c)) + La(y(e+h) — y(c)) 
+04 (ki (h), ke (h)) 

= Iy(za'(c)h + Oz (h)) + Le (y (c)h + 6, (h)) 
+07 (Ka (h), ke (h)) 


= (Ly2'(c) + Loy (c))h 
+1192 (h) + Loy (h) + Of (hi (h), he (A). 


We define L = (Ly2’ (c) + Loy’ (c)) and 6 (h) = 110, (h) + L268, (h) + Of (ki (h), ko (h)). By 
these definitions and the calculation above we have Equation (4.1) 
Equation: 


fog(et+h) — fog(c) = Lh+ Oh), 


so that it only remains to verify [link] for the function 0. We have seen above that the first two 
parts of 0 satisfy the desired limit condition, so that it is just the third part of @ that requires some 


proof. The required argument is analogous to the last part of the proof of the Chain Rule ((Link]), 
and we leave it as an exercise. 
Exercise: 


Problem: 


a. Finish the proof to the preceding theorem by showing that 
Equation: 


HINT: Review the corresponding part of the proof to [link]. 

b. Suppose f : S — R is as in the preceding theorem and that y is a real-valued function 
of a real variable. Suppose f is differentiable, as a function of two real variables, at the 
point (a, b), and that ¢ is differentiable at the point c = f(a, b). Let g = yo f. Finda 
formula for the partial derivatives of the real-valued function g of two real variables. 

. (A generalized Mean Value Theorem) Suppose uw is a real-valued function of two real 
variables, both of whose partial derivatives exist at each point in a disk B, (a, 6). 
Show that, for any two points (a, y) and (z’, y’) in B, (a, b), there exists a point 
(2, 9) on the line segment joining (a, y) to (a’, y’) such that 
Equation: 


tialu ,. . tialu _... 
u(a,y)—u(a’,y) = an (2,9) (w@ —2') + ar (2,5) (y—y). 


io) 


HINT: Let y : [0,1] + R? be defined by y (t) = (1 — t) (2’, y’) + t (2, y). Now use 
the preceding theorem. 

d. Verify that the assignment f — tial f /tialz is linear; i.e., that 
Equation: 


tial(f+g)  tialf _ tialg 
tialx ~ tialx tialx 


Check that the same is true for partial derivatives with respect to y. 


Integration, Average Behavior A=n r/\2 
A brief introduction to a group of modules about integration, with links to 
important theorems in those modules. 


In this chapter we will derive the formula for the area of a circle 
of radius Asa matter of fact, we will first have to settle on exactly what is 
the definition of the area of a region in the plane, and more subtle than that, 
we must decide what kinds of regions in the plane “have” areas. Before we 
consider the problem of area, we will develop the notion of the integral (or 
average value) of a function defined on an interval which notion we 
will use later to compute areas, once they have been defined. 


The main results of this chapter include: 


1. The definition of integrability of a function, and the definition of the 
integral of an integrable function, 

2. The Fundamental Theorem of Calculus ((link)), 

3. The Integral Form of Taylor's Remainder Theorem ((link]), 

4. The General Binomial Theorem ((link]), 

5. The definition of the area of a geometric set, 

6. ([link]), and 

7. The Integral Test ((link]). 


Integrals of Step Functions 

We begin by defining the integral of certain (but not all) bounded, real- 
valued functions whose domains are closed bounded intervals. Later, we will 
extend the definition of integral to certain kinds of unbounded complex- 
valued functions whose domains are still intervals, but which need not be 
either closed or bounded. 


We begin by defining the integral of certain (but not all) bounded, real- 
valued functions whose domains are closed bounded intervals. Later, we will 
extend the definition of integral to certain kinds of unbounded complex- 
valued functions whose domains are still intervals, but which need not be 
either closed or bounded. First, we recall from [link] the following 
definitions. 


Let [a, b] be a closed bounded interval of real numbers. By a partition of 
[a, b] we mean a finite set P = {1p < 21 <... < tn} of n+ 1 points, 
where 2p = aand az, = DB. 


The n intervals {{x;—1, x;|} are called the closed subintervals of the 
partition P, and the n intervals {(a;_1, x;)} are called the open 
subintervals or elements of P. 


We write || P || for the maximum of the numbers (lengths of the 
subintervals) {2; — x;_,}, and call || P || the mesh size of the partition 
Py 


If a partition P = {x;} is contained in another partition Q = {y;}, i-e., 
each x; equals some y;, then we say that Q is finer than P. 


Let f be a function on an interval [a, b], and let P = {1 < ... < xy} 
be a partition of [a, b]. Physicists often consider sums of the form 
Equation: 


Spty3 = >, F (ys) (ei — 2-1), 
i=1 


where y; is a point in the subinterval (a;_1, 2;). These sums (called 
Riemann sums) are approximations of physical quantities, and the limit 
of these sums, as the mesh of the partition becomes smaller and smaller, 
should represent a precise value of the physical quantity. What precisely 
is meant by the “ limit” of such sums is already a subtle question, but 
even having decided on what that definition should be, it is as important 
and difficult to determine whether or not such a limit exists for many (or 
even any) functions f. We approach this question from a slightly 
different point of view, but we will revisit Riemann sums in the end. 


Again we recall from [link] the following. 


Let [a, b] be a closed bounded interval in R. A real-valued function 

h: [a,b] > R is called a step function if there exists a partition 

P= {xo < 41 <... < £2} of [a,b] such that for each 1 < i < n there 
exists a number a; such that h (2) = a; for all x € (x;_1, x;). 


REMARK A step function hf is constant on the open subintervals (or 
elements) of a certain partition. Of course, the partition is not unique. Indeed, 
if P is such a partition, we may add more points to it, making a larger 
partition having more subintervals, and the function A will still be constant 
on these new open subintervals. That is, a given step function can be 
described using various distinct partitions. 


Also, the values of a step function at the partition points themselves is 
irrelevant. We only require that it be constant on the open subintervals. 
Exercise: 


Problem: 


Let h be a step function on [a, b], and let P = {ap < 41 <... < @y} 
be a partition of [a, b] such that h (2) = a; on the subinterval (a;_1, x;) 
determined by P. 


a. Prove that the range of A is a finite set. What is an upper bound on 
the cardinality of this range? 

b. Prove that / is differentiable at all but a finite number of points in 
[a, b]. What is the value of h’ at such a point? 


Bh 


. Let f be a function on [a, b]. Prove that f is a step function if and 


only if f’ (a) exists and = 0 for every x € (a, b) except possibly 
for a finite number of points. 


. What can be said about the values of h at the endpoints {x;} of the 


subintervals of P? 


. (e) Let h be a step function on [a, b], and let 7 be a function on 


[a, b] for which h(a) = j(x) for all 2 € |a, b] except for one point 
c. Show that 7 is also a step function. 

If k is a function on [a, b] that agrees with a step function h except 
at a finite number of points cy, C2, ..., cn, show that k is also a step 
function. 


Exercise: 


Problem: 


Let [a, b] be a fixed closed bounded interval in R, and let H((a, 6) 
denote the set of all step functions on [a, b]. 


a. 


Cc. 


d. 


Using Part (c) of [link], prove that the set H({a, b]) is a vector 
space of functions; i.e., it is closed under addition and scalar 
multiplication. 


. Show that H({a, b]) is closed under multiplication; i.e., if 


hi, he € H ([a, b]), then hih2 € A ({a, d)). 

Show that H({a, b)) is closed under taking maximum and 
minimum and that it contains all the real-valued constant functions. 
We call a function y an indicator function if it equals 1 on an 
interval (c, d) and is 0 outside [c, d]. To be precise, we will denote 
this indicator function by x(.,g). Prove that every indicator function 
is a step function, and show also that every step function h is a 
linear combination of indicator functions: 

Equation: 


h= > DiX (c;,ds)° 
j=l 


e. Define a function k on (0, 1] by setting k(x) = O if z is a rational 
number and k(a) = 1 if z is an irrational number. Prove that the 
range of k is a finite set, but that k is not a step function. 


Our first theorem in this chapter is a fundamental consistency result about 
the “area under the graph” of a step function. Of course, the graph of a step 
function looks like a collection of horizontal line segments, and the region 
under this graph is just a collection of rectangles. Actually, in this remark, we 
are implicitly thinking that the values {a;} of the step function are positive. 
If some of these values are negative, then we must re-think what we mean by 
the area under the graph. We first introduce the following bit of notation. 


Let h be a step function on the closed interval [a, b]. Suppose 

P= {ao < 41 <... < &p} is a partition of [a, b] such that h (x) = a; 
on the interval (x;-1, 2;). Define the weighted average of hrelative toP 
to be the number S'p (h) defined by 

Equation: 


Sp (h) = Ya (x; = ea): 


REMARK Notice the similarity between the formula for a weighted average 
and the formula for a Riemann sum. Note also that if the interval is a single 
point, i.e., a = 6, then the only partition P of the interval consists of the 
single point x9 = a, and every weighted average Sp (h) = 0. 


The next theorem is not a surprise, although its proof takes some careful 
thinking. It is simply the assertion that the weighted averages are 
independent of the choice of partition. 


Let h be a step function on the closed interval [a, b]. Suppose 

P= {ap < 41 <... < &p} isa partition of [a, b] such that h (x) = a; on 
the interval (x;_1,2;), and suppose Q = {yo < y1 < ... < Ym} is another 
partition of [a, b] such that h (x) = 6; on the interval (y;_1, y;). Then the 


weighted average of h relative to P is the same as the weighted average of h 
relative to Q. That is, Sp (h) = Sg (A). 


Suppose first that the partition Q is obtained from the partition P by adding 
one additional point. Then m = n + 1, and there exists an 79 between 1 and 
nm — 1 such that 


1. for0 <7 < ig we have y; = 2;. 
2 Di SY ead 
3. For 79 < 4 < n we have x; = yj41- 


In other words, y;,+1 is the only point of @ that is not a point of P, and y;,+41 
lies strictly between x;, and z;,+1. 


Because h is constant on the interval (x;,, 2iji1) = (Yio. Yin+2), it follows 
that 


1. For 1 = 1 = 10,0; = b;. 
2. Big +t A bin +2 = Qin+l: 
3. Fori9 +1 <2 < n,a; = 6j41. 


So, 
Equation: 


= », b; (yi =: Vi=1) + Qiy+] (Yin +2 es Yin) 
i=l 


n 
si S> bisa (Yisa — Yi) 
10 


= S b; (yi A Yi-1) + Qigti (Yin+2 — Yight 1 Yigt1 — Yin) 


i=l 
n+l 
+ Sb: (yi — H-1) 
1=19+3 
to 
= Ns bi (Ys — Ya-1) + Digta (Yigta — Yio) + Big +2 (Yio+2 — Yio+1) 
i=l 
ag S° b: (yi — Ysa) 
= S° b: (yi — Yi) 
i=l 
= Sq (h), 


which proves the theorem in this special case where Q is obtained from P by 
adding just one more point. 


It follows easily now by induction that if Q is obtained from P by adding 
any finite number of additional points, then h is constant on each of the open 


subintervals determined by Q, and Sg (h) = Sp (h). 


Finally, let Q = {yo < y1 < ... < Ym} be an arbitrary partition of [a, b], for 
which A is constant on each of the open subintervals (y;_1, y;) determined 
by Q. Define R to be the partition of |a, b] obtained by taking the union of 
the partition points {x;} and {y;}. Then R is a partition of |a, b] that is 
obtained by adding a finite number of points to the partition P, whence 

Sr (h) = Sp(h). Likewise, R is obtained from the partition Q by adding a 
finite number of points, whence S'z (h) = Sq (h), and this proves that 

Sg (h) = Sp(h), as desired. 


Let [a, b] be a fixed closed bounded interval in R. We define the 
integral of a step function h on [a, 6], and denote it by [ h, as follows: 
If P = {x9 < 2 <... < @,} is a partition of [a, b], for which 

h(a) = a; for all x € (x;_1, 2;), then 

Equation: 


fo — 20 ya a, 


REMARK The integral of a step function h is defined to be the weighted 
average of h relative to a partition P of [a, b]. Notice that the preceding 
theorem is crucial in order that this definition of f h be unambiguously 
defined. The integral of a step function should not depend on which partition 
is used. [link] asserts precisely this fact. 


Note also that if the interval is a single point, i.e., a = b, then the integral of 
every step function h is 0. 


We use a variety of notations for the integral of h : 


Equation: 
b b 
[r=] n= | h(t) dt. 


The following exercise provides a very useful way of describing the integral 
of a step function. Not only does it show that the integral of a step function 
looks like a Riemann sum, but it provides a description of the integral that 
makes certain calculations easier. See, for example, the proof of the next 
theorem. 

Exercise: 


Problem: 


Suppose A is a step function on |a, b] and that 
R= {z < 21 <... < Zn} isa partition of |a, b] for which h is 
constant on each subinterval (z;_1, z;) of R. 


a. Prove that 
Equation: 


fr Son) = a (2; — 25-1); 


i=1 


where, for each 1 < i < n,w, is any point in (z;_1, z;). (Note then 
that the integral of a step function takes the form of a Riemann 
sum.) 

b. Show that fh is independent of the values of h at the points {z;} 
of the partition R. 


Exercise: 


Problem: Let h, and hz be two step functions on [a, 5]. 


a. Suppose that hy (x) = ho (x) for all x € |a, b] except for one 
point c. Prove that [h, = f ha. HINT: Let P be a partition of 
[a, b], for which both hy and hg are constant on its open 
subintervals, and for which c is one of the points of P. Now use 
the preceding exercise to calculate the two integrals. 

b. Suppose hy (x) = hg (2) for all but a finite number of points 
C1,...,en € [a,b]. Prove that [hi = f he. 


We have used the terminology “weighted average” of a step function relative 
to a partition P. The next exercise shows how the integral of a step function 
can be related to an actual average value of the function. 

Exercise: 


Problem: 


Let h be a step function on the closed interval [a, b], and let 

P= {ap < 41 < ... < £,} be a partition of [a, b] for which 

h (a) = a; on the interval (x;_1, 2;). Let us think of the interval [a, 5] 
as an interval of time, and suppose that the function h assumes the value 
a; for the interval of time between x;_; and x;. Show that the average 
value A(h) taken on by A throughout the entire interval ({a, b]) of time 
is given by 

Equation: 


Let H((a, b]) denote the vector space of all step functions on the closed 
interval [a, b]. Then the assignment h + f[ h of H({a, b]) into R has the 
following properties: 


1. (Linearity) H({a, b]) is a vector space. Furthermore, 
f (hi + he) = fhit fhe, and fch =c fh for all 
hi, he, h © H ((a, bj), and for all real numbers c. 
2. If h = S37, @iX(c,,a;) is a linear combination of indicator functions 
(See part (d) of [link]), then fh = So7_, ai (di — c;). 
3. (Positivity) If h(x) > 0 for all x € [a, bj, then fh > 0. 
4. (Order-preserving) If h; and hg are step functions for which 
hy (x) < he (x) for all x € [a,b], then fhi < fhe. 


That H({a, b]) is a vector space was proved in part (a) of [link]. Suppose 
P= {ao < 41 <... < £p} is a partition of [a, b] such that h1 (a) is 
constant for all z € (x;-1,2;), and suppose Q = {yo < y1 < ... < Ym}isa 
partition of [a, b] such that hy (a) is constant for all x € (y;_1, y;). Let 


R= {xz < 21 <... < z,} be the partition of [a, b] obtained by taking the 
union of the z;'s and the y,'s. Then Ah; and he are both constant on each open 
subinterval of R, since each such subinterval is contained in some open 
subinterval of P and also is contained in some open subinterval of Q. 
Therefore, h; + hg is constant on each open subinterval of R. Now, using 
[link], we have that 

Equation: 


[ (hth) = So +h) we) a2) 


= >. hy (we) (24 — Ze-1) + S° he (wx) (Ze — Zr-1) 


k=1 k=1 
[or + [he 


This proves the first assertion of part (1). 


3 I 


Next, let P = {xp < 41 <... < &,} bea partition of |a, b] such that h(x) is 
constant on each open subinterval of P. Then ch(z) is constant on each open 
subinterval of P, and using [link] again, we have that 

Equation: 


/ (cn) = > hae) 


i=1 


= >» h (w;) (x; — t-1) 


= c fh, 


which completes the proof of the other half of part (1). 


To see part (2), we need only verify that f X(ci,d;) = i — Ci, for then part (2) 
will follow from part (1). But x(¢,4,) is just a step function determined by the 
four point partition {a, c;, d;, b} and values 0 on (a, c;) and (d;, b) and 1 on 
(c;, d;). Therefore, we have that f X(c,4,) = di — Ci- 


If h(a) > 0 for all x, and P = {xp < 21 < ... < 2} is as above, then 
Equation: 


pr = Yh (ws) (x; — 25-1) 2 0, 


and this proves part (3). 


Finally, suppose hy, (a) < he (x) for all x € [a, b]. By [link], we know that 
the function h3 = hg — hy isa step function on [a, b]. Also, hg (x) > 0 for 
all x € [a,b]. So, by part (3), f hs > 0. Then, by part (1), 


Equation: 
0< fhy= f (te—m) = fia fi 


which implies that f hi < f he, as desired. 
Exercise: 


Problem: 


a. Let h be the constant function c on [a, b]. Show that 
fh=c(b-a). 

b. Let a < c < d < bbe real numbers, and let h be the step function 
on [a, b] that equals r for c < x < dand 0 otherwise. Prove that 
[’h(t)dt=r(d—o). 

c. Let h be a step function on |a, b]. Prove that |h| is a step function, 
and that | {h| < f |h|. HINT: Note that 
—|h|(a) < h(a) < |h|(x). Now use the preceding theorem. 


d. Suppose h is a step function on [a, bj and that c is a constant for 
which |h(z)| < c for all z € [a, 6]. Prove that | fh| < c(b— a). 


Integrable Functions 

We now wish to extend the definition of the integral to a wider class of functions. This 
class will consist of those functions that are uniform limits of step functions. The 
requirement that these limits be uniform is crucial. Pointwise limits of step functions 
doesn't work, as we will see in the first exercise. The initial step in carrying out this 
generalization is the following. 


We now wish to extend the definition of the integral to a wider class of functions. This 
class will consist of those functions that are uniform limits of step functions. The 
requirement that these limits be uniform is crucial. Pointwise limits of step functions 
doesn't work, as we will see in [link] below. The initial step in carrying out this 
generalization is the following. 


Let [a, b] be a closed bounded interval, and let {h,,} be a sequence of step functions that 
converges uniformly to a function f on [a, b]. Then the sequence { J hn} is a convergent 
sequence of real numbers. 


We will show that { f hy} is a Cauchy sequence in R. Thus, given an € > 0, choose an NV 
such that for any n > N and any x € |{a, b], we have 
Equation: 


Then, for any m and n both > N and any z € [a,b], we have 


Equation: 
€ 
hn (@) — Rm (2)| SIltn (#) — f (@)| + |f (2) Am (@)| < 5 
Therefore, 
Equation: 
ft fmf] fe Aed|s fla tels f 5a ms 
b-—a 
as desired. 


The preceding theorem provides us with a perfectly good idea of how to define the integral 
of a function f that is the uniform limit of a sequence of step functions. However, we first 
need to establish another kind of consistency result. 


If {h,,} and {k,,} are two sequences of step functions on |a, b], each converging uniformly 


to the same function f, then 
tim fh, =tim f iy 


Equation: 

Given € > 0, choose NV so that ifn > N, then |h,, (x) — f (x)| < €/ (2 (b — a)) for all 
x € [a, 6], and such that | f (x)—k,p (x)| < €/ (2 (b — a)) for all x € [a, b]. Then, 

|hn (2) — kn (x)| < €/ (b— a) for all x € [a,b] ifn > N. So, 


Equation: 
€ 
Rn = kn << = = 
/ | i / b-—a 


ifn > N. Taking limits gives 
Equation: 


hn — kn 


tim f hy tim f k= €. 


Since this is true for arbitrary € > 0, it follows that lim fh, =lim f kp, as desired. 


Let |a, b] be a closed bounded interval of real numbers. A function f : [a,b] > R is 
called integrable on |a, b] if it is the uniform limit of a sequence {h,, } of step 
functions. 


Let I([a, b]) denote the set of all functions that are integrable on [a, b]. If 
f € I({a, 6]), define the integral of f, denoted f{ f, by 


Equation: 
/ f =lim / ins 


where {h,, } is some (any) sequence of step functions that converges uniformly to f 
on [a, 5]. 


As in the case of step functions, we use the following notations: 
Equation: 


[r-[r- frou 


REMARK Note that [link] is crucial in order that this definition be unambiguous. Indeed, 
we will see below that this critical consistency result is one place where uniform limits of 
step functions works while pointwise limits do not. See parts (c) and (d) of [link]. Note also 
that it follows from this definition that f° f = 0, because f” h = 0 for any step function. 
In fact, we will derive almost everything about the integral of a general integrable function 
from the corresponding results about the integral of a step function. No surprise. This is the 
essence of mathematical analysis, approximation. 

Exercise: 


Problem: 


Define a function f on the closed interval [0, 1] by f(x) = 1 if x is a rational number 
and f(x) = O if z is an irrational number. 


a. Suppose A is a step function on (0, 1]. Prove that there must exist an x € (0, 1] 
such that | f(a) — h(a)| > 1/2. HINT: Let (a2;~1, 2;) be an interval on which h 
is a constant c. Now use the fact that there are both rationals and irrationals in 
this interval. 

b. Prove that f is not the uniform limit of a sequence of step functions. That is, f is 

not an integrable function. 

. Consider the two sequences {h,, } and {k,,} of step functions defined on the 
interval [0, 1] by hn = x(0,1/n), and kn = NX (0,1/n). Show that both sequences 
{h,} and {k,,} converge pointwise to the 0 function on [0, 1]. HINT: All 
functions are 0 at z = 0. For z > 0, choose N so that 1/N < a. Then, for any 
tN hy (eo) ]ke(a) =O. 

d. Let h,, and k,, be as in part (c). Show that lim fh,, = 0, but lim fk, = 1. 

Conclude that the consistency result in [link] does not hold for pointwise limits 
of step functions. 


io) 


Exercise: 


Problem: Define a function f on the closed interval [0, 1] by f(x) = a. 


a. For each positive integer n, let P,, be the partition of [0, 1] given by the points 
{0 <1/n < 2/n < 3/n<... < (n—1)/n < 1}. Define a step function h,, on 
[0, 1] by setting hy (x) = i/nif + <2 < +, and hy (i/n) =i/n for all 
0 <i <n. Prove that | f (z)—h, (x)| < 1/n for all 2 € [0,1], and then 
conclude that f is the uniform limit of the h,'s whence f € I([0, 1]). 

b. Show that 


Equation: 


c. Show that fo f (t) dt = 1/2. The next exercise establishes some additional 
properties of integrable functions on an interval {a, b]. 


Exercise: 
Problem: 
Let |a, b] be a closed and bounded interval, and let f be an element of I(|a, }]). 


a. Show that, for each € > O there exists a step function h on [a, 6] such that 
| f(x) — h(x)| < € for all x € [a, B]. 


b. For each positive integer n let h,, be a step function satisfying the conclusion of 
part (a) for e = 1/n. Define k, = h, — 1/n andl, = hy, + 1/n. Show that ky, 
and l,, are step functions, that k, (x) < f (a) < I, (a) for all x € [a, b], and that 


ln (a) — ky (x)| = ln (2) — kp (x) = 2/n for all x. Hence, 
Jo (In = kn) = (0 ~ 0). 


‘2 


such that k(a) < f(x) < U(x) for which f (I(x) — k(z)) <e. 
d. Prove that there exists a sequence {7,,} of step functions on |a, b], for which 


Conclude from part (b) that, given any € > 0, there exist step functions k and I 


jn (£) < jn41 (2) < f (x) for all x, that converges uniformly to f. Show also 


that there exists a sequence {j,, } of step functions on [a, b], for which 
Jn (£) > Inia (2) > f (x) for all x, that converges uniformly to f. That is, if 
f € I((a, 6), then f is the uniform limit of a nondecreasing sequence of step 
functions and also is the uniform limit of a nonincreasing sequence of step 


functions. HINT: To construct the j,,'s and j,,'s, use the step functions k, and 1, 
of part (b), and recall that the maximum and minimum of step functions is again 


a step function. 
Show that if f(a) > 0 for all x € |a, bj, and g is defined by g(x) = \/ f(z), 

then g € I(|a, b]). HINT: Write f =lim h,, where h,, (x) > 0 for all x and n. 
Then use part (g) of Exercise 3.28. 


p 


eh 


that if Q = {xp < 41 <... < xy} is any partition finer than P, and {w;} are 
any points for which w; € (a;-1, 2;), then 
Equation: 


b n 
/ f (t) dt — dof (wi) (x; — j-1)|<e. 


. (Riemann sums again.) Show that, given an € > 0, there exists a partition P such 


HINT: Let P be a partition for which both the step functions k& and I of part (c) 
are constant on the open subintervals of P. Verify that for any finer partition Q, 
l(wi) > f (wi) > &(w;), and hence 

Equation: 


dE (wi) (te— a4) > df (wi) (te — a4) 2 dk (wi) (oy — a4). 


A bounded real-valued function f on a closed bounded interval a, 6] is called 
Riemann-integrable if, given any € > 0, there exist step functions & and J, on |a, b] for 
which k(x) < f(x) < U(a) for all x, such that f (J — k) < €. We denote the set of all 


functions on [a, b] that are Riemann-integrable by Ip ({a, }]). 


REMARK The notion of Riemann-integrability was introduced by Riemann in the mid 
nineteenth century and was the first formal definition of integrability. Since then several 
other definitions have been given for an integral, culminating in the theory of Lebesgue 
integration. The definition of integrability that we are using in this book is slightly different 
and less general from that of Riemann, and both of these are very different and less general 
from the definition given by Lebesgue in the early twentieth century. Part (c) of [link] 
above shows that the functions we are calling integrable are necessarily Riemann- 
integrable. We will see in [link] that there are Riemann-integrable functions that are not 
integrable in our sense. In both cases, Riemann's and ours, an integrable function f must be 
trapped between two step functions & and J. In our definition, we must have 

I(x) — k(x) < € for all x € [a, 6], while in Riemann's definition, we only need that 

f | —k < e. The distinction is that a small step function must have a small integral, but it 
isn't necessary for a step function to be (uniformly) small in order for it to have a small 
integral. It only has to be small on most of the interval [a, 6). 


On the other hand, all the definitions of integrability on [a, b] include among the integrable 
functions the continuous ones. And, all the different definitions of integral give the same 
value to a continuous function. The differences then in these definitions shows up at the 
point of saying exactly which functions are integrable. Perhaps the most enlightening thing 
to say in this connection is that it is impossible to make a “good” definition of integrability 
in such a way that every function is integrable. Subtle points in set theory arise in such 
attempts, and many fascinating and deep mathematical ideas have come from them. 
However, we will stick with our definition, since it is simpler than Riemann's and is 
completely sufficient for our purposes. 


Let |a, b] be a fixed closed and bounded interval, and let I([a, b]) denote the set of 
integrable functions on |a, b]. Then: 


1. Every element of I({a, b]) is a bounded function. That is, integrable functions are 
necessarily bounded functions. 

2. I({a, b]) is a vector space of functions. 

3. I({a, b]) is closed under multiplication; i.e., if f and g € I([a, b]), then 
fg € I([a, 6). 

4. Every step function is in I({a, 6). 

5. If f is a continuous real-valued function on [a, ], then f is in I([a, |). That is, every 
continuous real-valued function on [a, 6] is integrable on {a, b]. 


Let f € I({a, b]), and write f =lim h,,, where {h,,} is a sequence of step functions that 
converges uniformly to f. Given the positive number € = 1, choose N so that 

| f (2)—hy (x)| < 1 for all x € [a, b]. Then | f (x)| <|hy (x)| + 1 for all x € [a, b]. 
Because hy is a step function, its range is a finite set, so that there exists a number M for 
which |hy (x)| < M for all x € |[a, b]. Hence, | f(x)| < M+ 1 forall x € [a, }], and this 
proves part (1). 


Next, let f and g be integrable, and write f =lim h,, and g =lim k,,, where {h,,} and 
{ky} are sequences of step functions that converge uniformly to f and g respectively. If s 
and t¢ are real numbers, then the sequence {sh,, + tk,,} converges uniformly to the 
function sf + tg. See parts (c) and (d) of [link]. Therefore, sf + tg € I({a,b]), and 
I({a, b]) is a vector space, proving part (2). 


Note that part (3) does not follow immediately from [link]; the product of uniformly 
convergent sequences may not be uniformly convergent. To see it for this case, let 

f =lim A, and g =lim k,, be elements of I({a, b]). By part (1), both f and g are bounded, 
and we write M; and M, for numbers that satisfy | f (x)| < My and |g(x)| < M, for all 
x € [a,b]. Because the sequence {k,,} converges uniformly to g, there exists an N such 
that ifn > N we have |g (x)—k,, (x)| < 1 for all x € [a, b]. This implies that, ifn > N, 
then |k,, (z)| <M, + 1 for all x € [a, 5]. 


Now we show that fg is the uniform limit of the sequence h,,k,. For, ifn > N, then 
Equation: 


[f (2)9 (@)—hn (@)kn (@)| = |F (x)9 (@) — fe) Rn (@) + F(@) Rn () — hn (@) Rn (2)| 


( (x)k 
If (2)|lg (@) — Rn (2) |+] hn (@)I|F (2) — An (@)| 
Msg (#)—kn (&)|-+ (My + 1) |f (2) — hn (2)I, 
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which implies that fg =lim (hyk,). 


If h is itself a step function, then it is obviously the uniform limit of the constant sequence 
{h}, which implies that h is integrable. 


Finally, if f is continuous on [a, b], it follows from [link] that f is the uniform limit of a 
sequence of step functions, whence f € I(|a, }]). 
Exercise: 


Problem: 
Let f be the function defined on [0, 1] by f(z) =sin (1/2) if e 4 0 and f(0) = 0. 


a. Show that f is continuous at every nonzero x and discontinuous at 0. HINT: 
Observe that, on any interval (0, 6), the function sin (1/) attains both the 
values 1 and —1. 

b. Show that f is not integrable on |0, 1]. HINT: Suppose f =lim h,,. Choose N so 
that | f (z)—hy (x)| < 1/2 for all x € [0,1]. Let P be a partition for which hy 
is constant on its open subintervals, and examine the situation for z's in the 
interval (xo, 21). 

c. Show that fis Riemann-integrable on [0, 1]. Conclude that I([a, b]) is a proper 
subset of Ip (|a, b]). 


Exercise: 


Problem: 


a. Let f be an integrable function on [a, b|]. Suppose g is a function for which 
g(x) = f(x) for all x € |a, b] except for one point c. Prove that g is integrable 
and that fg = f f. HINT: If f =lim hy, define k, (x) = hy (x) for alla #c 
and ky, (c) = g(c). Then use [link]. 
b. Again, let f be an integrable function on [a, b|. Suppose g is a function for which 
g(x) = f(x) for all but a finite number of points cy, ..., cw € [a, b]. Prove that 
g € I({a, }]), and that fg = ff. 
. Suppose f is a function on the closed interval |a, 6], that is uniformly continuous 
on the open interval (a, b). Prove that f is integrable on |a, b]. HINT: Just 
reproduce the proof to [link]. 


io) 


REMARK In view of part (b) of the preceding exercise, we see that whether a function f 
is integrable or not is totally independent of the values of the function at a fixed finite set of 
points. Indeed, the function needn't even be defined at a fixed finite set of points, and still it 
can be integrable. This observation is helpful in many instances, e.g., in parts (d) and (e) of 
[link]. 


The assignment f — f f on I({a, 6]) satisfies the following properties. 


1. (Linearity) I({a, |) is a vector space, and f(af + 89) =af f+ fg forall 
fg € I (a, bland a, 8 € R. 


2. (Positivity) If f(x) > 0 for all x € [a,b], then f f > 0. 

3. (Order-preserving) If f, g € I([a, b]) and f(x) < g(x) for all x € [a, 6], then 
Jia g 

4. If f € I([a, b]), then so is |f|, and | ff] < ff]. 

5. If f is the uniform limit of functions f,,, each of which is in I({a, b]), then 
f € 1({a,6]) and f f =lim f fy. 

6. Let {un} be a sequence of functions in J([a, b]). Suppose that for each n there is a 
number m,,, for which |u, (x)| < mr» for all x € [a,b], and such that the infinite 
series }) m,, converges. Then the infinite series 5) u, converges uniformly to an 
integrable function, and fS* un = > fun. 


That I([a, b]) is a vector space was proved in part (2) of [link]. Let f and g be in J((a, 5), 


and write f =lim h, and g =lim k,, where the h,,'s and the k,,'s are step functions. Then 
af + Bg =lim (ah, + Bk,,), so that, by [link] and the definition of the integral, we have 


Equation: 
[ (as +88) = tim [ (a, + Bk) 


I| I| I| 
Q Q = 
Se EE 
+ SS, 
> > > 
=> + 3 
= r— + 
=r DR 
5 SS, 
—, = 
- 


which proves part (1). 


Next, if f € I(a, b]) satisfies f(x) > 0 for all x € |a, bj, let {1,,} be a nonincreasing 
sequence of step functions that converges uniformly to f. See part (d) of [link]. Then 
l, (x) > f (x) > 0 for all x and all n. So, again by [link], we have that 


Equation: 
[faim fan >o. 


This proves part (2). 
Part (3) now follows by combining parts (1) and (2) just as in the proof of [link]. 


To see part (4), let f € I([a, b]) be given. Write f =lim h,. Then | f| =lim|h,,|. For 
Equation: 


If ()[—[An (|| < [Ff (@)—An (2)]- 


Therefore, | f| is integrable. Also, 


Equation: 
[Is =lim [ >tim | fh, tim f hn|=| f #] 


To see part (5), let { f,, } be a sequence of elements of I([a, b]), and suppose that 
f =lim f,. For each n, let h,, be a step function on [a, 6] such that 
| fn (2) — An (x)| < 1/n for all x € [a, 6]. Note also that it follows from parts (3) and (4) 


that 
b— 
n 


Equation: 
Now {h,,} converges uniformly to f. For, 
Equation: 


hy, 


JF (2)—hn (2) <_F (@)—Fn (@)I+ LF (@) — An (2) 
< If @)-fr@)l+—, 


showing that f =lim h,,. Therefore, f € I([a,6]). Moreover, f f =lim fh. Finally, 


ff =lim f fh, for 
Equation: 

[t- fo] = fa- fl] foam fo 
[r- fialer—. 


\ 
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This completes the proof of part (5). 


Part (6) follows directly from part (5) and the Weierstrass M Test ([link]). For, part (1) of 
that theorem implies that the infinite series }* wu, converges uniformly, and then 
f Sun = f un follows from part (5) of this theorem. 


As a final extension of our notion of integral, we define the integral of certain complex- 
valued functions. 


Let [a, b| be a fixed bounded and closed interval. A complex-valued function 
f = u+ wis called integrable if its real and imaginary parts u and v are integrable. 


In this case, we define 
b b b b 
/ Pay (uti) = | u+if Vv. 
a a a a 


Equation: 
1. The set of all integrable complex-valued functions on |a, 6] is a vector space over the 
field of complex numbers, and 


Equation: 
[otro =0f se fs 


for all integrable complex-valued functions f and g and all complex numbers @ and £. 
2. If f is an integrable complex-valued function on [a, 6], then so is | f|, and 
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We leave the verification of part (1) to the exercise that follows. 


To see part (2), suppose that f is integrable, and write f = u + iv. Then |f| = Vu? 4 v?, 
so that | f| is integrable by [link] and part (e) of [link]. Now write z= ff, and write z in 
polar coordinates as z = re’’, where r = |z| =| ri f|. (See [link].) Define a function g by 
g(x) =e “f (a) and notice that |g| = |f|. Then Sg =e 8 i. f =7, which is a real 
number. Writing g = & + 70, we then have that r = fu -+i fd, implying that fo = 0. 
So, 

Equation: 


b 


< | fa 
a 
b 
< l9| 
a 
b 
= fl, 
a 
as desired. 
Exercise: 


Problem: Prove part (1) of the preceding theorem. 


HINT: Break a, 8, f f, and fg into real and imaginary parts. 


The Fundamental Theorem of Calculus 
This module covers two important theorems, including the fundamental 
theorem of calculus. 


We begin this section with a result that is certainly not a surprise, but we 
will need it at various places in later proofs, so it's good to state it precisely 
now. 


Suppose f € I([a, b]), and suppose a < c < b. Then f € I({a, c]), 


f € I((c,b]), and 
fr-for+ fit 


Equation: 

Suppose first that h is a step function on [a, bj, and let 

P= {ap < 21 <... < £,} bea partition of [a, b] such that h (2) = a; on 
the subinterval (x;_1, 2;) of P. Of course, we may assume without loss of 
generality that c is one of the points of P, say c = xx. Clearly h is a step 
function on both intervals [a, c] and |c, 6]. 


Now, let Q; = {a = %9 < 4 <... <c = xx} be the partition of [a, c| 
obtained by intersecting P with [a, c], and let 

Qo = {c= xR < Ley < ... < Ln = D} be the partition of [c, b| obtained 
by intersecting P with [c, b]. We have that 

Equation: 


n 


a; (x; = Li-1) = S a; (x; = Li-1) 


i=k+1 


»: 
= So, (h) + Sa, (h) 
fons fn 


which proves the theorem for step functions. 


Now, write f =lim h,,, where each h,, is a step function on [a, b]. Then 
clearly f =lim h,, on [a, c], which shows that f € I({a,c]), and 


Equation: 
i: f =lim : h 


Similarly, f =lim h,, on [c, b], showing that f € I([c, b|), and 


Equation: 
b b 
i f =lim / h 


Finally, 
Equation: 


I| 
ey 
5 
aN 
Q 
oO 
=> 
3 
+ 
lo) 
ao 
=> 
3 
NWS 


| 
5 
> 
= 
+ 
5 
> 
3 


\| 
2 
oy, 
++ 

~ 


as desired. 


I's time for the trumpets again! What we call the Fundamental Theorem of 
Calculus was discovered by Newton and Leibniz more or less 
simultaneously in the seventeenth century, and it is without doubt the 
comerstone of all we call mathematical analysis today. Perhaps the main 
theoretical consequence of this theorem is that it provides a procedure for 
inventing “new” functions. Polynomials are rather natural functions, power 
series are a simple generalization of polynomials, and then what? It all 
came down to thinking of a function of a variable z as being the area 
beneath a curve between a fixed point a and the varying point x. By now, 
we have polished and massaged these ideas into a careful, detailed 
development of the subject, which has substantially obscured the original 
ingenious insights of Newton and Leibniz. On the other hand, our 
development and proofs are complete, while theirs were based heavily on 
their intuition. So, here it is. 

Fundamental Theorem of Calculus 


Suppose f is an arbitrary element of I({a, b]). Define a function F’ on [a, 6] 
by F(x) = f’ f. Then: 


1. F is continuous on [a, b], and F(a) = 0. 
2. If f is continuous at a point c € (a, b), then F is differentiable at c and 


F'(c) = f(¢). 


3. Suppose that f is continuous on |a, b]. If G is any continuous function 
on [a, b] that is differentiable on (a, b) and satisfies G’ (x) = f (x) for 


all x € (a,b), then 
Equation: 


b 
i f(t) dt = G(b) — G (a). 


REMARK Part (2) of this theorem is the heart of it, the great discovery of 
Newton and Leibniz, although most beginning calculus students often think 
of part (3) as the main statement. Of course it is that third part that enables 


us to actually compute integrals. 


Because f € I([a, b]), we know that f € I(|a, x]) for every x € [a, b], so 


that F(a) at least is defined. 


Also, we know that f is bounded; i.e., there exists an M such that 
|f(t)| < M for all t € [a, b]. Then, if x, y € [a, b] with x > y, we have 


that 
Equation: 


|F'(x) — F(y)| 


IA 
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so that |F'(x) — F(y)| < M|a —y| < eif |x —y| < 6 =€/M. This 
shows that F’ is (uniformly) continuous on [a, b]. Obviously, 
F (a) = f” f =0, and part (1) is proved. 


Next, suppose that f is continuous at c € (a,b), and write L = f(c). Let 
€ > 0 be given. To show that F is differentiable at c and that 
F' (c) = f (c), we must find a 6 > 0 such that if 0 < |h| < 6 then 


Equation: 


wa ee 


Since f is continuous at c, choose 6 > 0 so that | f(t) — f(c)| < eif 
|t — c| < 6. Now, assuming that h > 0 for the moment, we have that 


Equation: 
cth Cc 
Fe+h)-F = f s- fF 
- an Cc 
-fs+for-fs 
cth 
— / J, 
and 
Equation: 
cth 
L 
t Tia Je 
h 


So, if0 < h < 6, then 
Equation: 


where the last inequality follows because for t € [c, c + h], we have that 
Jt — c| <A < 6.A similar argument holds if h < 0. (See the following 
exercise.) This proves part (2). 


Suppose finally that G is continuous on [a, 6], differentiable on (a, b), and 
that G’ (x) = f (x) for all 2 € (a, b). Then, F’ — G is continuous on |a, 5], 
differentiable on (a, b), and by part (2) 

(F — G)' (x) = F' (x) — G' (x) = f (x) — f (x) = 0 forall z € (a,b). 
It then follows from [link] that F’ — G is a constant function C’, whence, 
Equation: 


GW) -G(a) =F) +C-F@)-C=F w= f Fat 


and the theorem is proved. 
Exercise: 


Problem: 


a. Complete the proof of part (2) of the preceding theorem; i.e., take 
care of the case when h < 0. HINT: In this case,a< c+h<e. 


Then, write [" f = ee age eee 

b. Suppose f is a continuous function on the closed interval [a, b], 
and that f’ exists and is continuous on the open interval (a, 0). 
Assume further that f’ is integrable on the closed interval [a, b). 
Prove that f(z) — f(a) = f° f’ forall x € [a,b]. Be careful to 
understand how this is different from the Fundamental Theorem. 

c. Use the Fundamental Theorem to prove that for x > 1 we have 
Equation: 


cane 
ln (2) =F ee) =| — at, 
bo 
and for 0 < x < 1 we have 
Equation: 
ral 


In (2) = F(2) =~ [ = 


HINT: Show that these two functions have the same derivative 
and agree at x = 1. 


Consequences of the Fundamental Theorem 

The first two theorems of this section constitute the basic “techniques of 
integration” taught in a calculus course. However, the careful formulations 
of these standard methods of evaluating integrals have some subtle points, 
i.e., some hypotheses. Calculus students are rarely told about these details. 


The first two theorems of this section constitute the basic “techniques of 
integration” taught in a calculus course. However, the careful formulations 
of these standard methods of evaluating integrals have some subtle points, 
i.e., some hypotheses. Calculus students are rarely told about these details. 
Integration by Parts Formula 


Let f and g be integrable functions on [a, b], and as usual let F' and G 
denote the functions defined by 
Equation: 


Fa)= fF, td c(e)= fo. 


Then 
Equation: 


b b 
[ te-1FOCw-F@cta)- | Fs 


Or, recalling that f = F’ and g = G’, 
Equation: 


b b 
/ F'G =[F (6)G (b) — F(a)G (a)] — / FG’. 


Exercise: 


Problem: 


a. Prove the preceding theorem. HINT: Replace the upper limit b by 
a variable x, and differentiate both sides. By the way, how do we 
know that the functions F’g and fG are integrable? 

b. Suppose f and g are integrable functions on |a, b] and that both f’ 
and g’ are continuous on (a, b) and integrable on [a, |. (Of 
course f’ and g’ are not even defined at the endpoints a and b, but 
they can still be integrable on [a, b]. See the remark following 
[link].) Prove that 
Equation: 


b b 
/ fa’ = [F (0)9(b) — F(@g(a)] - i, Pe 


Integration by Substitution 


Let f be a continuous function on [a, 6], and suppose g is a continuous, one- 
to-one function from [c, d] onto [a, b] such that g is continuously 
differentiable on (c, d), and such that a = g(c) and b = g(d). Assume 
finally that g’ is integrable on [c, d]. Then 

Equation: 


b d 
/ f (t)dt = / f(g(s))9" (2) ds. 


It follows from our assumptions that the function f (g (s))g’ (s) is 
continuous on (a, b) and integrable on [c, dj. It also follows from our 
assumptions that g maps the open interval (c, d) onto the open interval 
(a, b). As usual, let F’ denote the function on [a, 6] defined by 

F (x) = [* f (€) dt. Then, by part (2) of the Fundamental Theorem, F is 
differentiable on (a,b), and F’’ = f. Then, by the chain rule, F'o gis 
continuous and differentiable on (c,d) and 

Equation: 


(F'og)' (s) = F’ (g(s))9' (s) = f (9(s))g' (s). 


So, by part (3) of the Fundamental Theorem, we have that 
Equation: 


d d 
/ f(g(s))g' (s)da = / (Fo g)' (s) ds 


| 
- 
——~ 
ce) 
—~ 
= 
| 
- 
——~ 
ce) 
—~ 
— 


| 
a 
— 
SS 
= 


which finishes the proof. 
Exercise: 


Problem: 


a. Prove the “Mean Value Theorem” for integrals: If f is continuous 
on |a, 6], then there exists a c € (a, b) such that 
Equation: 


b 
i: f(t) dt = f(e) (0— a). 


b. (Uniform limits of differentiable functions. Compare with [link].) 
Suppose { f,,} is a sequence of continuous functions on a closed 
interval [a, b] that converges pointwise to a function f. Suppose 
that each derivative f/ is continuous on the open interval (a, b), 
is integrable on the closed interval [a, b], and that the sequence 
{ f’ } converges uniformly to a function g on (a, b). Prove that f 
is differentiable on (a,b), and f’ = g. HINT: Let z be in (a, b), 
and let c be in the interval (a, x). Justify the following equalities, 
and use them together with the Fundamental Theorem to make 
the proof. 

Equation: 


f (2)  f(6) im (fn (2) = fa(o)) tim ff = fg 


We revisit now the Remainder Theorem of Taylor, which we first presented 
in [link]. The point is that there is another form of this theorem, the integral 
form, and this version is more powerful in some instances than the original 
one, e.g., in the general Binomial Theorem below. 

Integral Form of Taylor's Remainder Theorem 


Let c be a real number, and let f have n + 1 derivatives on (c — r,c + 1), 
and suppose that f("t)) € I ([e —r,c+r]). Then for eache <2 <c+r, 
Equation: 


f (x) — Ti, () = a ferry a) =)" dt, 


n! 


where 7’; denotes the nth Taylor polynomial for f. 


Similarly, forc —r <a <ce, 
Equation: 


F(e)— Thy (2) = ff Qo at 


n! 


Exercise: 


Problem: Prove the preceding theorem. 


HINT: Argue by induction on n, and integrate by parts. 


REMARK We return now to the general Binomial Theorem, first studied in 
[link]. The proof given there used the derivative form of Taylor's remainder 
Theorem, but we were only able to prove the Binomial Theorem for 


|t| < 1/2. The theorem below uses the integral form of Taylor's Remainder 
Theorem in its proof, and it gives the full binomial theorem, i.e., for all ¢ for 
which |t| < 1. 

General Binomial Theorem 


Let a = a + bi be a fixed complex number. Then 
Equation: 


for allt € (—1,1). 


For clarity, we repeat some of the proof of [link]. Given a general 
a = a+ bi, consider the function g : (—1,1) — C defined by 
g(t) = (1+t)®. Observe that the nth derivative of g is given by 


Equation: 
e a(a—1)...(a—-n+1 
g' ) (t) = ce ld 
(1+¢) 


Then g € C™ ((—1,1)). 


For each nonnegative integer k define 
Equation: 


ind OE a(a — HoT kt) _ (°), 


and set h(t) = 07°, axt®. The radius of convergence for the power series 
function h is 1, as was shown in [link]. We wish to show that g(t) = h(t) 
for all —1 < t < 1. That is, we wish to show that g is a Taylor series 
function around 0. It will suffice to show that the sequence {S,,} of partial 


sums of the power series function h converges to the function g. We note 
also that the nth partial sum is just the nth Taylor polynomial T° for g. 


Now, fix a ¢ strictly between 0 and 1. The argument for t's between —1 and 
0 is completely analogous.. Choose an € > 0 for which 6 = (1+ e)t < 1. 
We let C, be a numbers such that | (*) |< C.(1 + €)” for all nonnegative 
integers n. See [link]. We will also need the following estimate, which can 
be easily deduced as a calculus exercise (See part (d) of [link].). For all s 
between 0 and t, we have (¢ — s)/(1 +s) < t. Note also that, for any 

s € (0, t), we have |(1 + s)“|= (1 +s)", and this is trapped between 1 
and (1 + t)*. Hence, there exists a number M; such that 

(1+ s)* "|< M; for all s € (—0, t). We will need this estimate in the 
calculation that follows. 


Then, by the integral form of Taylor's Remainder Theorem, we have: 
Equation: 


|g (t)— oe a,t* 
k=0 


foe al 
[( ee “Ja jg yet 8) dsl 


n+1 


< f'\(,2,)il@+artin+ mss] ae 
< [ (2) M, (n+ 1)t” ds 
< 


t 
C.-M; (n+ 1) / (Pe) da 
0 


C.M,(n+1)(1+6)" te" 
CM: (n te De 


which tends to 0 as n goes to oo, because G < 1. This completes the proof 
tor 0 i= 1. 


Area of Regions in the Plane 
This module contains an axiom of choice and also includes various theorems and 
exercises related to the usage of integrals in finding the area of regions in the plane. 


It would be desirable to be able to assign to each subset S$ of the Cartesian plane R? a 
nonnegative real number A(S) called its area. We would insist based on our intuition 
that (i) if S is a rectangle with sides of length Z and W then the number A(S) should 
be LW, so that this abstract notion of area would generalize our intuitively 
fundamental one. We would also insist that (ii) if S were the union of two disjoint parts, 
S = S; U S9, then A(S) should be A (.S;) + A ($2). (We were taught in high school 
plane geometry that the whole is the sum of its parts.) In fact, even if S were the union 
of an infinite number of disjoint parts, S = U°°_, S, with $; S; = Wifi 4 j, we 
would insist that (iii) A (S) = S>°°, A(S,). 


The search for such a definition of area for every subset of R? motivated much of 
modern mathematics. Whether or not such an assignment exists is intimately related to 
subtle questions in basic set theory, e.g., the Axiom of Choice and the Continuum 
Hypothesis. Most mathematical analysts assume that the Axiom of Choice holds, and as 
a result of that assumption, it has been shown that there can be no assignment 

S — A(S) satisfying the above three requirements. Conversely, if one does not assume 
that the Axiom of Choice holds, then it has also been shown that it is perfectly 
consistent to assume as a basic axiom that such an assignment S — A(S) does exist. 
We will not pursue these subtle points here, leaving them to a course in Set Theory or 
Measure Theory. However, Here's a statement of the Axiom of Choice, and we invite 
the reader to think about how reasonable it seems. 


AXIOM OF CHOICE Let 7 be a collection of sets. Then there exists a set A that 
contains exactly one element out of each of the sets S$ in. 


The difficulty mathematicians encountered in trying to define area turned out to be 
involved with defining A(S) for every subset S € R?. To avoid this difficulty, we will 
restrict our attention here to certain “ reasonable” subsets S. Of course, we certainly 
want these sets to include the rectangles and all other common geometric sets. 


By a (open) rectangle we will mean a set R = (a,b) x (c,d) in R?. That is, 
R={(a,y):a<a2<bandc< y< d}. The analogous definition of a closed 
rectangle|a, b] x |c, d] should be clear: 

[a,b] x [c,d] = {(a,y):a<acbc<yK< dh. 


By the area of a (open or closed) rectangle R = (a,b) x (c,d) or [a,b] x [c, d] 
we mean the number A(R) = (b—a)(d—c).. 


The fundamental notion behind our definition of the area of a set S is this. If an open 
rectangle R = (a,b) x (c,d) is a subset of S, then the area A(S) surely should be 
greater than or equal to A(R) = (b — a)(d — c). And, if S contains the disjoint union 
of several open rectangles, then the area of S should be greater than or equal to the sum 
of their areas. 


We now specify precisely for which sets we will define the area. Let [a, b] be a fixed 
closed bounded interval in R and let / and wu be two continuous real-valued functions 
on [a, b] for which I(x) < u(z) for all x € (a, B). 


Given [a, b], 1, and wu as in the above, let S be the set of all pairs (x, y) € R?, for 
whicha < x < bandl(x) < y < u(x). Then S is called an open geometric set. 
If we replace the < signs with < signs, i.e., if S is the set of all (a, y) such that 
a<a«<b,andl(x) < y < u(z), then S is called a closed geometric set. In 
either case, we say that S is bounded on the left and right by the vertical line 
segments {(a, y) : l(a) < y < u(a)} and {(b, y) : l(b) < y < u(b)}, and it is 
bounded below by the graph of the function / and bounded above by the graph of 
the function u. We call the union of these four bounding curves the boundary of 
S, and denote it by Cg. 


If the bounding functions u and J of a geometric set S are smooth or piecewise 
smooth functions, we will call S' a smooth or piecewise smooth geometric set. 


If S is a closed geometric set, we will indicate the corresponding open geometric set by 
the symbol S°. 


The symbol S° we have introduced for the open geometric set corresponding to a 
closed one is the same symbol that we have used previously for the interior of a set. 
Study the exercise that follows to see that the two uses of this notation agree. 
Exercise: 


Problem: 


a. Show that rectangles, triangles, and circles are geometric sets. What in fact is 
the definition of a circle? 

b. Find some examples of sets that are not geometric sets. Think about a 
horseshoe on its side, or a heart on its side. 

c. Let f be a continuous, nonnegative function on |a, b]. Show that the “region” 
under the graph of f is a geometric set. 

d. Show that the intersection of two geometric sets is a geometric set. Describe 
the left, right, upper, and lower boundaries of the intersection. Prove that the 
interior (SM S2)° of the intersection of two geometric sets S, and S> 
coincides with the intersection S$? MS? of their two interiors. 


e. Give an example to show that the union of two geometric sets need not be a 
geometric set. 

f. Show that every closed geometric set is compact. 

g. Let S be a closed geometric set. Show that the corresponding open geometric 
set S° coincides with the interior of S, i.e., the set of all points in the interior 
of S. HINT: Suppose a < x < bandI(x) < y < u(z). Begin by showing 
that, because both J and w are continuous, there must exist ane > O anda 
6 >Osuchthata<x—d<2+6< bandl(xz) << y-—Ee<yte< u(z). 


Now, given a geometric set S (either open or closed), that is determined by an interval 
[a, b] and two bounding functions u and J, let P = {xp < 41 <...< an}bea 
partition of [a, b]. For each 1 < i < n, define numbers c; and d; as follows: 
Equation: 


cj; = sup I(x), andd;= inf u(a). 
Lji-1<2< 2; Tj-1<L< Tj 


Because the functions / and wu are continuous, they are necessarily bounded, so that the 
supremum and infimum above are real numbers. For each 1 < 2 < n define R; to be 
the open rectangle (2;-1, x;) x (c;, d;). Of course, d; may be < c;, in which case the 
rectangle R; is the empty set. In any event, we see that the partition P determines a 
finite set of (possibly empty) rectangles {R;}, and we denote the union of these 
rectangles by the symbol @p. = U"_, (2-1, 2;) X (c, di). 


The area of the rectangle R; is (x; — 4-1) (d; — c;) if c; < d; and 0 otherwise. We 
may write in general that A (R;) = (x; — 2;_1) max ((d; — c;), 0). Define the 
number A p by 

Equation: 


Ap = > (x; —_ pi=1) (d; —_ Cj). 


Note that Ap is not exactly the sum of the areas of the rectangles determined by P 
because it may happen that d; < c; for some 7's, so that those terms in the sum would 
be negative. In any case, it is clear that Ap is less than or equal to the sum of the areas 
of the rectangles, and this notation simplifies matters later. 


For any partition P, we have S > @p, so that, if A(S) is to denote the area of S, we 
want to have 
Equation: 


A(S) 


IV 


» A(R;) 


= S° (x; — £;_,) max ((d; — ¢;),0) 


=A 
Ps ye (x; — #1) (d; — ¢;) 
=I 


= Ap. 


Let S be a geometric set (either open or closed), bounded on the left by x = a, on 
the right by z = b, below by the graph of J, and above by the graph of wu. Define 
the area A(S) of S by 


Equation: 


A(S) = Ap= ¢— 3-1) (di — G), 
(5) sup Ap sup S> (ai — @i-1) (di — 1) 


P={29<21<...<2n} i=l 


where the supremum is taken over all partitions P of |a, 6], and where the 
numbers c; and d; are as defined above. 


Exercise: 


Problem: 


a. Using the notation of the preceding paragraphs, show that each rectangle R; 
is a subset of the set S and that R; 1 R; = 0 ifi ¥ j. It may help to draw a 
picture of the set S and the rectangles {R;}. Can you draw one so that 
d; < cj? 

b. Suppose 5S} is a geometric set and that S» is another geometric set that is 
contained in S;. Prove that A (S_) < A (.S;). HINT: For each partition P, 
compare the two A p's. 


Exercise: 
Problem: 
Let T be the triangle in the plane with vertices at the three points (0, 0), (0, H), 


and (B, 0). Show that the area A(T), as defined above, agrees with the formula 
A = (1/2)BH, where B is the base and 4 is the height. 


The next theorem gives the connection between area (geometry) and integration 
(analysis). In fact, this theorem is what most calculus students think integration is all 
about. 


Let S be a geometric set, i.e., a subset of R? that is determined in the above manner by 
a closed bounded interval [a, b] and two bounding functions J and wu. Then 
Equation: 


b 
A(S) = (u(x) —I(a)) da. 


Let P = {xp < 41 < ... < @,} bea partition of |a, b], and let c; and d; be defined as 
above. Let h be a step function that equals d; on the open interval (x;-1, x;), and let k 
be a step function that equals c; on the open interval (x;_1, 2;). Then on each open 
interval (x;-1, 2;) we have h(x) < u(x) and k(x) > I(x). Complete the definitions of 
h and k by defining them at the partition points so that h (x;) = k(a;) for all 7. Then 
we have that h(x) — k(x) < u(x) — I(x) for all x € |a, 6]. Hence, 

Equation: 


n b b 
Ap => (0-214) (d-e)= | (hn) < f (u —1). 


i=1 


Since this is true for every partition P of [a, 6], it follows by taking the supremum over 
all partitions P that 
Equation: 


b 
A(S) =sup Ap < / (u (2) —1(2)) de, 


a 


which proves half of the theorem; i.e., that A(S) < i u—l. 


To see the other inequality, let h be any step function on |a, b] for which h(x) < u(x) 
for all x, and let k be any step function for which k(x) > l(a) for all x. Let 

P= {xo < 21 <... < &n} bea partition of |a, b] for which both h and k are constant 
on the open subintervals (x;_1, 2;) of P. Let a1, a2, ..., @n and by, bo, ..., b, be the 
numbers such that h (2) = a; on (#;-1, x;) and k (x) = 6; on (x;_1, 2;). It follows, 
since h(x) < u(x) for all x, that a; < d;. Also, it follows that b; > c;. Therefore, 
Equation: 


/ (h —k) = > (a; — b;) (wi — 2-1) < 2 (a; — v;-1) (d; — c;) = Ap < A(S). 


Finally, let {h,,, } be a nondecreasing sequence of step functions that converges 
uniformly to wu, and let {k,,,} be a nonincreasing sequence of step functions that 
converges uniformly to J. See part (d) of [link]. Then 

Equation: 


[wo =tip (im — bn) < ACS), 


which proves the other half of the theorem. 
OK! Trumpet fanfares, please! 
(A = rr?.) If S is a circle in the plane having radius 7, then the area A(S) of S is mr’. 


Suppose the center of the circle S is the point (h, k). This circle is a geometric set. In 
fact, we may describe the circle with center (h, k) and radius r as the subset 9 of R? 
determined by the closed bounded interval [h — r, h + r] and the functions 
Equation: 


u(z)=k+ /r? — (« —h)? 


and 
Equation: 


(2) =k fr? — (@ —hy?. 


By the preceding theorem, we then have that 


Equation: 
h+r 
A(s)= | 24/172 — (x — h)* dx = mr’. 
h-r 


We leave the verification of the last equality to the following exercise. 
Exercise: 


Problem: Evaluate the integral in the above proof: 


Equation: 
h+r 
i 24/172 — (a — h)? dx. 


Be careful to explain each step by referring to theorems and exercises in this book. 
It may seem like an elementary calculus exercise, but we are justifying each step 
here. 


REMARK There is another formula for the area of a geometric set that is sometimes 
very useful. This formula gives the area in terms of a “double integral.” There is really 
nothing new to this formula; it simply makes use of the fact that the number (length) 
u(x) — I(a) can be represented as the integral from J(x) to u(x) of the constant 1. 


Here's the formula: 
b u(x) 
A(S) = : / 1dy | dz. 
a I(x) 


Equation: 

The next theorem is a result that justifies our definition of area by verifying that the 
whole is equal to the sum of its parts, something that any good definition of area should 
satisfy. 


Let S' be a closed geometric set, and suppose S = U?_,S;, where the sets {.5;} are 
closed geometric sets for which $2 S° = Qifi #7. Then 
Equation: 


A(S) = > AS). 


Suppose S$ is determined by the interval [a, b] and the two bounding functions / and u, 
and suppose $; is determined by the interval [a,;, b;| and the two bounding functions J; 
and u;. Because $; C S, it must be that the interval [a;, b;] is contained in the interval 
[a, b]. Initially, the bounding functions 1; and u; are defined and continuous on [a;, b;], 
and we extend their domain to all of |a, 6] by defining J; (x) = u; (x) = 0 for all 

x € |a, 5] that are not in [a;, b;]. The extended functions J; and u; may not be 


continuous on all of |a, 6], but they are still integrable on [a, b|. (Why?) Notice that we 
now have the formula 
Equation: 


bi b 
A(S;) = - (u; (x) — 1; (x)) dx = / (u; (x) — 1; (x)) dx. 


Next, fix an a in the open interval (a, b). We must have that the vertical intervals 

(0; (x), w; (w)) and (1; (a), uw; (x)) are disjoint if 1 A 7. Otherwise, there would exist a 
point y in both intervals, and this would mean that the point (x, y) would belong to 
both $? and S$ i. which is impossible by hypothesis. Therefore, for each x € (a, b), the 
intervals { (1; ()x), u; (z))} are pairwise disjoint open intervals, and they are all 
contained in the interval (I(a), u(a)), because the S;'s are subsets of S. Hence, the 
sum of the lengths of the open intervals {(U; (a), u; (#))} is less than or equal to the 
length of (U(), u(x)). Also, for any point y in the closed interval {J(x), u(a)], the 
point (a, y) must belong to one of the S;'s, implying that y is in the closed interval 

(J; (x), wu; (a)] for some 2. But this means that the sum of the lengths of the closed 
intervals [I; (x), u; (x)] is greater than or equal to the length of the interval 

[J(a), w(x)]. Since open intervals and closed intervals have the same length, we then 


see that (u(x) — I(x) = S07, (ui (x) — 1; (a). 


We now have the following calculation: 
Equation: 


n_ pb 

> , (us (2) 1 (a) ) de 
bn 

/ Ss (u; (x) — 1; (a)) dx 


, 


| 


I 


I 


b 
i. (u(x) —I(x)) dx 
A(S), 


| 


which completes the proof. 


Extending the Definition of Integrability 

We now wish to extend the definition of the integral to a wider class of 
functions, namely to some that are unbounded and Others whose domains 
are not closed and bounded intervals. This extended definition is somewhat 
ad hoc, and these integrals are sometimes called “improper integrals.” 


We now wish to extend the definition of the integral to a wider class of 
functions, namely to some that are unbounded and Others whose domains 
are not closed and bounded intervals. This extended definition is somewhat 
ad hoc, and these integrals are sometimes called “improper integrals.” 


Let f be a real or complex-valued function on the open interval (a, b) 
where a is possibly —oo and b is possibly +00. We say that f is 
improperly-integrable on (a, b) if it is integrable on each closed and 
bounded subinterval [a’, 6’) C (a,b), and for each point c € (a,b) we 


have that the two limits lim b’ > b — 0 fe fandlimg aio fof 
exist. 


More generally, We say that a real or complex-valued function f, not 
necessarily defined on all of the open interval (a, b), is improperly- 
integrable on (a, b) if there exists a partition {2;} of [a, 6] such that f 
is defined and improperly-integrable on each open interval (2-1, 2;). 


We denote the set of all functions f that are improperly-integrable on 
an open interval (a, b) by I; ((a, b)). 


Analogous definitions are made for a function's being integrable on half- 
open intervals |a, b) and (a, b). 


Note that, in order for f to be improperly-integrable on an open interval, we 
only require f to be defined at almost all the points of the interval, i.e., at 
every point except the endpoints of some partition. 

Exercise: 


Problem: 


a. Let f be defined and improperly-integrable on the open interval 
(a, b). Show that limy ,ai0 fo, f+ limy_,4-0 he f is the same 
for all c € (a, b). 

b. Define a function f on (0,1) by f (x) = (1 — z) 1/”. Show that 
f is improperly-integrable on (0, 1) and that f is not bounded. 
(Compare this with part (1) of [link].) 

c. Define a function g on (0, 1) by g(x) = (1 — x) '. Show that g 
is not improperly-integrable on (0, 1), and, using part (b), 
conclude that the product of improperly-integrable functions on 
(0, 1) need not itself be improperly-integrable. (Compare this 
with part (3) of [Llink].) 

d. Define h to be the function on (0, 00) given by h(x) = 1 for all 
x. Show that A is not improperly-integrable on (0, 00). (Compare 
this with parts (4) and (5) of [link].) 


Part (a) of the preceding exercise is just the consistency condition we need 
in order to make a definition of the integral of an improperly-integrable 
function over an open interval. 


Let f be defined and improperly-integrable on an open interval (a, b). 
We define the integral of f over the interval (a, b), and denote it by 


J? f, by 


Equation: 


b c ; 
/ a eet [ I+ vim, | I. 


In general, if f is improperly-integrable over an open interval, i.e., f is 
defined and improperly-integrable over each subinterval of (a, b) 
determined by a partition {x;}, then we define the integral of f over 
the interval (a, b) by 

Equation: 


b n Li 
Lids 


Let (a, b) be a fixed open interval (with a possibly equal to —co and b 
possibly equal to +oo), and let I; ((a, 6)) denote the set of improperly- 
integrable functions on (a, b). Then: 


1. I; ((a, b)) is a vector space of functions. 

2. (Linearity) - (af + Bg) = af’ f+ BS? g for all f, g € I; ((a, )) 
anda, BEC. 

3. (Positivity) If f(x) > 0 for all x € (a,b), then { 72%, 

4. (Order-preserving) If f, g € I; ((a,b)) and f(x) < g(x) for all 


x € (a,b), then ei < fg. 


Exercise: 


Problem: 


a. Use [link], [link], [link], and properties of limits to prove the 
preceding theorem. 

b. Let f be defined and improperly-integrable on (a, b). Show that, 
given an € > O, there exists a 6 > 0 such that for any 
a<a’<a+d6andanyb—6< 0)! < bwehave 


fo ft Sp dl <e 


c. Let f be improperly-integrable on an open interval (a, b). Show 
that, given an € > 0, there exists a d > 0 such that if (c, d) is any 


open subinterval of (a, b) for which d — c < 6, then | ‘a fl<e. 


HINT: Let {x; } be a partition of [a, b] such that f is defined and 

improperly-integrable on each subinterval (x;_1, x; ). For each 3, 

choose a 6; using part (b). Now f is bounded by M on all the 

intervals [x;_1 + 6;, x; — 6;], 80 6 = €/M should work there. 
d. Suppose f is a continuous function on a closed bounded interval 

[a, b] and is continuously differentiable on the open interval 

(a, b). Prove that f’ is improperly-integrable on (a, b), and 


evaluate ie f'. HINT: Fix a point c € (a,b), and use the 
Fundamental Theorem of Calculus to show that the two limits 
exist. 

e. (Integration by substitution again.) Let g : [c, d] — [a, b| be 
continuous on [c, d] and satisfy g(c) = a and g(d) = b. Suppose 
there exists a partition {ro < x1 <... < £,} of the interval [c, d] 
such that g is continuously differentiable on each subinterval 
(x;_1, x;). Prove that g’ is improperly-integrable on the open 
interval (c, d). Show also that if f is continuous on [a, b], we 
have that 
Equation: 


b d 
/ f (t)dt = / f(9(s))g (8) ds. 


HINT: Integrate over the subintervals (2,1, 2;), and use part (d). 


REMARK Note that there are parts of [link] and [link] that are not asserted 
in [link]. The point is that these other properties do not hold for improperly- 
integrable functions on open intervals. See the following exercise. 
Exercise: 


Problem: 


a. Define f to be the function on [1, 00) given by 
f(x) =(-1)"' /nifn —1 <a <n. Show that f is 
improperly-integrable on (1, 00), but that | f| is not improperly- 
integrable on (1, oo). (Compare this with part (4) of [link].) 
HINT: Verify that [ i. f is a partial sum of a convergent infinite 


series, and then verify that f- ‘a |f| is a partial sum of a divergent 
infinite series. 

b. Define the function f on (1, co) by f(x) = 1/z. For each 
positive integer n, define the function f, on (1, 00) by 
fn (x) =1/aif 1 < x < mand f, (x) = 0 otherwise. Show that 
each f,, is improperly-integrable on (1, co), that f is the uniform 


limit of the sequence {f,,}, but that f is not improperly- 
integrable on (1, oo). (Compare this with part (5) of Theorem 
5.6.) 

c. Suppose f is a nonnegative real-valued function on the half-open 
interval (a, co) that is integrable on every closed bounded 
subinterval [a, b'). For each positive integer n > a, define 
i= ie f (x) dx. Prove that f is improperly-integrable on 
[a, co) if and only if the sequence {y,,} is convergent. In that 
case, show that f™ f =lim y,,. 


We are now able to prove an important result relating integrals over infinite 
intervals and convergence of infinite series. 


Let f be a positive function on {1, oo), assume that f is integrable on every 
closed bounded interval |1, 6], and suppose that f is nonincreasing; i.e., if 


x < ythen f(x) > f(y). For each positive integer i, set a; = f (7), and let 


Sy denote the Nth partial sum of the infinite series S* a; :Sy = ae Qj. 


Then: 


1. For each NV, we have 
Equation: 


N 
Sy-a< [ J (@) de 5 yes: 
1 


2. For each N, we have that 
Equation: 


N 
Sym f f (x) dz <a, —ayn < a}; 
1 


i.e., the sequence {8 N-1— { :. f \ is bounded above. 


3. The sequence {s v-i— ff . 7 \ is nondecreasing. 


4. (Integral Test) The infinite series 5° a; converges if and only if the 
function f is improperly-integrable on (1, 00). 


For each positive integer VV, define a step function ky on the interval 

[1, N] as follows. Let P = {ay < 21 <... < 2y_1} be the partition of 
[1, N] given by the points {1 <2<3<...< Nh,ie, x2; =i+1. Define 
ky (a) to be the constant c; = f (i + 1) on the interval 

[z;-1, ;) = [¢,i + 1). Complete the definition of ky by setting 

ky (N) = f (NV). Then, because f is nonincreasing, we have that 

ky (x) < f (x) forall  € [1, NJ. Also, 

Equation: 


N N-1 
/ kn = 3 CAR eA) 
1 * 


which then implies that 
Equation: 


N N 
Sy-a= [ ky (0) dx < | f (xyz. 
1 1 


This proves half of part (1). 


For each positive integer VV > 1 define another step function / jy, using the 
same partition P as above, by setting ly (x) = f (i) ifi <2 <i+1 for 
1 <i < N, and complete the definition of ly by setting ly (N) = f (N). 
Again, because f is nonincreasing, we have that f (x) < ly (a) for all 

x € {1, NJ. Also 

Equation: 


[ow - yr 


which then implies that 
Equation: 


N N 
/ f(a)de < | ly (x) dx = Syn-4, 
1 1 
and this proves the other half of part (1). 


It follows from part (1) that 
Equation: 


N 
Sva- | f (a) dx < Sy_-1—Sn +a) = a1 — ay, 
1 


and this proves part (2). 


We see that the sequence 1s Net f - A \ is nondecreasing by observing 


that 
Equation: 


IV 
= 


because f is nonincreasing. 


Finally, to prove part (4), note that both of the sequences {Sy} and 

{ J id f ¢ are nondecreasing. If f is improperly-integrable on [1, co), then 
lim jv jf . f exists, and Sy <a, + f io f (x) dz for all N, which implies 
that 5° a; converges by [link]. Conversely, if 5) a; converges, then lim Sy 
exists. Since ‘al f (x) dx < Sy_1, it then follows, again from [link], that 
lim f . f (x) dx exists. So, by the preceding exercise, f is improperly- 
integrable on [1, co). 


We may now resolve a question first raised in [link]. That is, for 1 < s < 2, 
is the infinite series 5) 1/n* convergent or divergent? We saw in that 
exercise that this series is convergent if s is a rational number. 

Exercise: 


Problem: 


a. Let s be a real number. Use the Integral Test to prove that the 
infinite series 5° 1/n* is convergent if and only if s > 1. 

b. Let s be a complex number s = a + 02. Prove that the infinite 
series ) 1/n® is absolutely convergent if and only if a > 1. 


Exercise: 


Problem: Let f be the function on [1, oo) defined by f(x) = 1/z. 


. Use [link] to prove that the sequence pee +_InN \ 


i=1 i 

converges to a positive number y < 1. (This number 7¥ is called 
Euler's constant.) HINT: Show that this sequence is bounded 
above and nondecreasing. 

. Prove that 

Equation: 


HINT: Write S27 for the 2th partial sum of the series. Use the 
fact that 
Equation: 


Now add and subtract In (2/V) and use part (a). 


Integration in the Plane 
A selection of theorems about integration in the plane. Theorems from other modules 
from the same author are referenced, and practice exercises are included. 


Let S be a closed geometric set in the plane. If f is a real-valued function on S, we would 
like to define what it means for f to be “integrable” and then what the “integral” of f is. 
To do this, we will simply mimic our development for integration of functions on a closed 
interval [a, 6]. 


So, what should be a “step function” in this context? That is, what should is a “partition” 
of S be in this context? Presumably a step function is going to be a function that is 
constant on the “elements” of a partition. Our idea is to replace the subintervals 
determined by a partition of the interval [a, b] by geometric subsets of the geometric set S. 


The overlap of two geometric sets S; and S is defined to be the interior (9) S2)° 
of their intersection. S, and S$» are called nonoverlapping if this overlap (SM S>)° 
is the empty set. 


A partition of a closed geometric set 9 in R? is a finite collection {5}, S2,..., Sn} of 
nonoverlapping closed geometric sets for which U?_, S; = S; i.e., the union of the S; 
's is all of the geometric set S. 


The open subsets {sy } are called the elements of the partition. 


A step function on the closed geometric set S is a real-valued function h on S for 
which there exists a partition P = {S;} of S such that h (z) = a; for all z € S?; 
i.e., h is constant on each element of the partition P. 


REMARK One example of a partition of a geometric set, though not at all the most 
general kind, is the following. Suppose the geometric set S is determined by the interval 
(a, b] and the two bounding functions wu and I. Let {2p < 1 < ... < &n} be a partition of 
the interval [a, b]. We make a partition {S;} of S by constructing vertical lines at the 
points x; from I(x;) to u(#;). Then S; is the geometric set determined by the interval 
[z;-1, @;| and the two bounding functions wu; and 1; that are the restrictions of u and J to 
the interval [x;-1, xi]. 


A step function is constant on the open geometric sets that form the elements of some 
partition. We say nothing about the values of h on the “boundaries” of these geometric 
sets. For a step function A on an interval |a, 6], we do not worry about the finitely many 
values of A at the endpoints of the subintervals. However, in the plane, we are ignoring 
the values on the boundaries, which are infinite sets. As a consequence, a step function on 
a geometric set may very well have an infinite range, and may not even be a bounded 
function, unlike the case for a step function on an interval. The idea is that the boundaries 


of geometric sets are “negligible” sets as far as area is concerned, so that the values of a 
function on these boundaries shouldn't affect the integral (average value) of the function. 


Before continuing our development of the integral of functions in the plane, we digress to 
present an analog of [link] to functions that are continuous on a closed geometric set. 


Let f be a continuous real-valued function whose domain is a closed geometric set S. 
Then there exists a sequence {h,,} of step functions on S that converges uniformly to f. 


As in the proof of [link], we use the fact that a continuous function on a compact set is 
uniformly continuous. 


For each positive integer n, let 6,, be a positive number satisfying | f(z) — f(w)| < 1/n 
if |z — w| < 6,. Such a6, exists by the uniform continuity of f on S. Because S is 
compact, it is bounded, and we let R = |a, b| x [c,d] be a closed rectangle that contains 
S. We construct a partition {S”} of S as follows. In a checkerboard fashion, we write R 
as the union UR? of small, closed rectangles satisfying 


1. If z and w are in R?, then |z — w| < 6,. (The rectangles are that small.) 
Z ao N Re = 0. (The interiors of these small rectangles are disjoint.) 


Now define S” = $M R?. Then $S?° NO sre = (), and S = US}. Hence, {S?} isa 
partition of S. 


For each i, choose a point z; in S$’, and set a = f (z?). We define a step function h,, as 
follows: If z belongs to one (and of course only one) of the open geometric sets oo. set 
hn (z) = a’. And, if z does not belong to any of the open geometric sets ae set 

hn (z) = f (2). It follows immediately that h,, is a step function. 


Now, we claim that | f (z)—An (z)| < 1/n for all z € S. For any z in one of the $”'s, we 
have 
Equation: 


If (2)—hn (2)| =lf2)—-aF |= fF (2) — f (27) |< 1/n 


because |z—z”|< d,,. And, for any z not in any of the S”"'s, f (z) — An (z) = 0. So, we 
have defined a sequence {h,,} of step functions on S, and the sequence {h,,} converges 
uniformly to f by [link]. 


What follows now should be expected. We will define the integral of a step function h 
over a geometric set S' by 
Equation: 


[r=Yo x A(S;). 


We will define a function f on S to be integrable if it is the uniform limit of a sequence 
{h,,} of step functions, and we will then define the integral of f by 


Equation: 
/ f =lim / vee 
Ss Ss 


Everything should work out nicely. Of course, we have to check the same two consistency 
questions we had for the definition of the integral on [a, 5], i.e., the analogs of [link] and 
[link]. 


Let S be a closed geometric set, and let h be a step function on S. Suppose 
P= {S},...,S,} and Q = {T),..., Ti} are two partitions of S for which h(z) is the 
constant a; on S? and h(z) is the constant b; on Ty Then 


Equation: 
S> a;A (S;) => S> b;A (T;). 
i=l j=l 


We know by part (d) of [link] that the intersection of two geometric sets is itself a 
geometric set. Also, for each fixed index 7, we know that the sets {T; N Sc} are pairwise 


disjoint. Then, by [link], we have that A (T;) = 53, A(T; 9 S;). Similarly, for each 
fixed 7, we have that A (5, = 1 A (T; 9 Si). Finally, for each pair ¢ and j, for which 
the set r MS? is not empty, choose a point 2;,; € T? M S?, and note that 

a; = h(zi,;) = bj, because z;,; belongs to both SP and T\?. 


With these observations, we then have that 
Equation: 


which completes the proof. 
OK, the first consistency condition is satisfied. Moving right along: 


Let h be a step function on a closed geometric set S. Define the integral of h over 
the geometric set S by the formula 
Equation: 


[r= [#@e= Laas, 


where 5S}, ..., 5, is a partition of S for which h is the constant a; on the interior S i 
of the set S;. 


Just as in the case of integration on an interval, before checking the second consistency 
result, we need to establish the following properties of the integral of step functions. 


Let H(S) denote the vector space of all step functions on the closed geometric set S. 
Then the assignment h — fh of H(S) into R has the following properties: 


1. (Linearity) H(S) is a vector space, and [, (hi + ho) = fohi+ fg ho, and 
fgch =cJf,h for all hy, h,h € H(S), and for all real numbers c. 

2. If h = Soi, cixs, is a linear combination of indicator functions of geometric sets 
that are subsets of S, then fh = )°"_, c;A (Si). 

3. (Positivity) If h(z) > 0 for all z € S, then fh > 0. 

4. (Order-preserving) If hy and hy are step functions on S for which hy (z) < hg (z) 
forall z € S,then fohi < fo ho. 


Suppose hj is constant on the elements of a partition P = {.S;} and hg is constant on the 
elements of a partition Q = {T;}. Let V be the partition of the geometric set S whose 


elements are the sets {U;,} = {39 Nn wee Then both h, and hg are constant on the 


elements U;, of V, so that hy + hg is also constant on these elements. Therefore, h; + hg 
is a step function, and 
Equation: 


[os +h) = >> (ag + bx) A (Uz) = > an A (Un) + S- bp (Uz) = [or + Jv 
k k k 


and this proves the first assertion of part (1). 


The proof of the other half of part (1), as well as parts (2), (3), and (4), are totally 
analogous to the proofs of the corresponding parts of [link], and we omit the arguments 
here. 


Now for the other necessary consistency condition: 
let S be a closed geometric set in the plane. 


1. If {h,,} is a sequence of step functions that converges uniformly to a function f on 
S, then the sequence { J s hy} is a convergent sequence of real numbers. 
2. If {h,} and {k,,} are two sequences of step functions on S that converge uniformly 


to the same function f, then 
Equation: 
tim | hy aim f i. 
S S 


Exercise: 


Problem: Prove [link]. Mimic the proofs of [link] and [link]. 


If f is a real-valued function on a closed geometric set S in the plane, then f is 
integrable on S if it is the uniform limit of a sequence {h,,} of step functions on S. 


We define the integral of an integrable function f on S by 


Equation: 
[r= [ t@a=im fn, 


where {h,,} is a sequence of step functions on S that converges uniformly to f. 


Let S be a closed geometric set in the plane, and let [(.S') denote the set of integrable 
functions on S. Then: 


1. I(S) is a vector space of functions. 

2. If f and g € I(S), and one of them is bounded, then fg € I(S). 

3. Every step function is in I(S). 

4. If f is a continuous real-valued function on S, then f is in J(S). That is, every 
continuous real-valued function on S is integrable on S. 


Exercise: 


Problem: 


a. Prove [link]. Note that this theorem is the analog of [link], but that some things 
are missing. 

b. Show that integrable functions on S are not necessarily bounded; not even step 
functions have to be bounded. 

c. Show that, if f € I(S), and g is a function on S for which f(z, y) = 9(z, y) 
for all (a, y) in the interior S° of S, then g € I(S). That is, integrable functions 
on S can do whatever they like on the boundary. 


Let S be a closed geometric set. The assignment f — [ f on I(S) satisfies the following 
properties. 


1. (Linearity) I(.S) is a vector space, and [, (af + 6g) =afof+ A J,g forall 
f,g €I(S)anda, Be R. 
2. (Positivity) If f(z) > 0 for all z € S, then f, f > 0. 
3. (Order-preserving) If f,g € I(S) and f(z) < g(z) forall z € S, then fo f < fog. 
4. If f € I(S), then so is |f|, and | fy f| < fo |fl- 
5. If f is the uniform limit of functions f,,, each of which is in I(S), then f € I(S) and 


Is f =lim eee 


6. Let {un } be a sequence of functions in I(S), and suppose that for each n there is a 
number m,,, for which |u,, (z)| < mrp for all z € S, and such that the infinite series 
> m, converges. Then the infinite series 5) u,, converges uniformly to an 
integrable function, and fy )> Un = >> fg Un: 

7. If f € I(S), and {5}, ...,S,} is a partition of S, then f € I(S;) for all 7, and 


Equation: 
n 
kde 
Exercise: 
Problem: 


Prove [link]. It is mostly the analog to [link]. To see the last part, let h; be the step 
function that is identically 1 on S;; check that h; f € I (S;); then examine 


> Js fri 


Of course, we could now extend the notion of integrability over a geometric set S' to 
include complex-valued functions just as we did for integrability over an interval {a, 5]. 
However, real-valued functions on geometric sets will suffice for the purposes of this 
book. 


We include here, to be used later in [link], a somewhat technical theorem about 
constructing partitions of a geometric set. 


Let 5S}, ..., S, be closed, nonoverlapping, geometric sets, all contained in a geometric set 


S. Then there exists a partition a iS. m Of S such that for 1 < i < n wehave S; = S. 
In other words, the s;'s are the first n elements of a partition of S. 


Suppose S is determined by the interval [a, b] and the two bounding functions u and J. We 
prove this theorem by induction on n. 


If n = 1, let S; be determined by the interval [a, b;| and the two bounding functions u4 
and l;. Set S = 1, and define four more geometric sets So, oer Ss as follows: 


1. 5 is determined by the interval [a, a;| and the two bounding functions u and / 
restricted to that interval. 

2. S3 is determined by the interval [a , b;| and the two bounding functions u and wu; 
restricted to that interval. 

3. S4 is determined by the interval |a;, b,| and the two bounding functions J and 1, 
restricted to that interval. 


4. $s is determined by the interval [b;, 6] and the two bounding functions wu and J 
restricted to that interval. 


Observe that the five sets $1, So, ...,.S5 constitute a partition of the geometric set S, 
proving the theorem in the case n = 1. 


Suppose next that the theorem is true for any collection of n sets satisfying the 
hypotheses. Then, given Sj, ...,. 5,41 as in the hypothesis of the theorem, apply the 
inductive hypothesis to the n sets Sj, ...,.S, to obtain a partition 7}, ..., J, of S for 
which 7; = S; for all 1 <7 <n. For eachn + 1 <i < m, consider the geometric set 
S! = Spii1T; of the geometric set T;. We may apply the case n = 1 of this theorem to 
this geometric set to conclude that S; is the first element Si of a partition 


{$1 4, S)0, +++) Sim, } of the geometric set Tj. 


Define a partition {Su} of S as follows: For 1 < k < n, set cS = Ty. Set 


Sri = Um 41931 = Sn41- And define the rest of the partition {Si} to be made up of 
the remaining sets Si forn+1<2i< mand2 <j < ™,. It follows directly that this 
partition {Su} satisfies the requirements of the theorem. 

Exercise: 


Problem: 


Let $1, ..., Sp, be as in the preceding theorem. Suppose S; is determined by the 
interval [a;, b,| and the two bounding functions ux and J;,. We will say that Sj, is 
“below” S;, equivalently S;; is “above” S,, if there exists a point x such that 

uz (x) < 1, (x). Note that this implies that x € [ax, bg] M [a;, 55]. 


a. Suppose Sz is below S, and suppose (z, yz) € S, and (z, y;) € Sj. Show that 
Yj > Yk. That is, if S; is below S;, then no part of S;, can be above S;. 

b. Suppose S¥ is below S; and $3 is below S2. Show that no part of S3 can be 
above S;. HINT: By way of contradiction, let 2; € [a1, bi] be such that 
U2 (air) <l (ei) let rg € [a2, ba| be such that ws (x2) <I» (x2); and 
suppose x3 € [a3, b3] is such that wz (#3) < 3 (x3). Derive contradictions for 
all possible arrangements of the three points x), 9, and z3. 

c. Prove that there exists an index kp such that S;,, is minimal in the sense that 
there is no other S; that is below S;,. HINT: Argue by induction on n. Thus, let 
{T;} be the collection of all S;'s that are below $j, and note that there are at 
most n — 1 elements of {T;}. By induction, there is one of the Tj's, i.e., an Sho 
that is minimal for that collection. Now, using part (b), show that this S;,, must 
be minimal for the original collection. 


There is one more concept about integrating over geometric sets that we will need in later 
chapters. We have only considered sets that are bounded on the left and right by straight 
vertical lines and along the top and bottom by graphs of continuous functions y = u(z) 
and y = I(x). We equally well could have discussed sets that are bounded above and 
below by straight horizontal lines and bounded on the left and right by graphs of 
continuous functions x = I(y) and x = r(y). These additional sets do not provide 
anything particularly important, so we do not discuss them. However, there are times 
when it is helpful to work with geometric sets with the roles of horizontal and vertical 
reversed. We accomplish this with the following definition. 


Let S be a subset of R?. By the symmetric image of S we mean the set S of all 
points (x,y) € R? for which the point (y, z) € S. 


The symmetric image of a set is just the reflection of the set across the y = z line in the 
plane. Note that the symmetric image of the rectangle |a, b| x {c, d] is again a rectangle, 
[c,d] x [a, b], and therefore the area of a rectangle is equal to the area of its symmetric 
image. This has the implication that if the symmetric image of a geometric set is also a 
geometric set, then they both have the same area. The symmetric image of a geometric set 
doesn't have to be a geometric set itself. For instance, consider the examples suggested in 
part (b) of [link]. But clearly rectangles, triangles, and circles have this property, for their 
symmetric images are again rectangles, triangles, and circles. For a geometric set, whose 
symmetric image is again a geometric set, there are some additional computational 
properties of the area of S as well as the integral of functions over S, and we present them 
in the following exercises. 

Exercise: 


Problem: 


Suppose S is a closed geometric set, which is determined by a closed interval [a, 
and two bounding functions u(x) and J(a). Suppose the symmetric image S of S is 


also a closed geometric set, determined by an interval a, 7 and two bounding 


functions @ (a) and i (a). 


a. Make up an example to show that the numbers @ and 6 need not have anything 
to do with the numbers a and 8, and that the functions @ and / need not have 
anything to do with the functions u and J. 


b. Prove that $' and § have the same area. HINT: use the fact that the area of a 
geometric set is approximately equal to the sum of the areas of certain 
rectangles, and then use the fact that the area of the symmetric image of a 
rectangle is the same as the area of the rectangle. 


c. Show that for every point (x, y) € S, we must have @ < y < b, and for every 
such y, we must have i(y) <a < u(y). HINT: If (z,y) € S, then (y,xz) € S. 


d. (d) Prove that the area A(.S) of S is given by the formula 


Equation: 
b pu(x) 6 pay) 
A(s)= | / Ldyde = [ 7 1 dxdy. 
a Ji(z) a JI(y) 
(See [link].) 


e. Let S be the right triangle having vertices (a, c), (b,c), and (b, d), where d > c. 


Describe the symmetric image of S; i.e., find the corresponding G, b, u, and i. 
Use part (d) to obtain the following formulas for the area of S : 
Equation: 


b pd++% (cd) d 
A(S) = 1 dsdt = 1 dsdt. 
a Jc c db £4 (a—b) 


Exercise: 


Problem: 


a. Prove that if S; and S5 are geometric sets whose symmetric images are again 
geometric sets, then the symmetric image of the geometric set S; 1 Sy is also a 
geometric set. 

b. Suppose 7’ is a closed geometric set that is contained in a closed geometric set 
S. Assume that both the symmetric images T and S are also geometric sets. If S 
is determined by an interval fa, b| and two bounding functions @ and i, prove 


that 
Equation: 


6 pu(s) 
a(r)= | I. xr (t, s) dtds, 


where yr is the indicator function of the set T; i.e., x7 (t, s) = 1if (t, s) € T, 
and yr (t, s) = Oif (t,s) ¢ T. HINT: See the proof of [link], give names to all 
the intervals and bounding functions, and in the end use part (d) of the 
preceding exercise. 

. Suppose {.S;} is a partition of a geometric set S, and suppose the symmetric 
images of S and all the S;'s are also geometric sets. Suppose h is a step function 
that is the constant a; on the element S? of the partition {5}. Prove that 
Sigh = Sj a Sg X09, and therefore that 


Equation: 


ie 


b pu(t) b pals) 
/ i= / / h(t, s) dsdt = / / h (t, s) dtds. 
8 a Ji(t) a Ji(s) 


HINT: Use part (b). 

d. Let S' be a geometric set whose symmetric image S is also a geometric set, and 
suppose f is a continuous function on S.. Show that 
Equation: 


b pu(t) 6 pias) 
/ f= / / f (t, 8) dsdt = i 7 f (t, s) dtds. 
Ss a Ji(t) a Ji(s) 


HINT: Make use of the fact that the step functions constructed in [link] satisfy 
the assumptions of part (c). Then take limits. 

e. Let S be the triangle in part (e) of the preceding exercise. If f is a continuous 
function on S, show that the integral of f over S is given by the formulas 
Equation: 


b pd++*(c—d) d 
/ j= / / f (t, s) dsdt = / / f (t, s) dtds. 
S a vc c b+ <4 (a—b) 


Integration Over Smooth Curves in the Plane C=2n r 

A brief introduction of the upcoming chapter by Lawrence Baggett. In this 
chapter we will define what we mean by a smooth curve in the plane and 
what is meant by its arc length. These definitions are a good bit more tricky 
than one might imagine. Indeed, it is the subtlety of the definition of arc 
length that prevented us from defining the trigonometric functions in terms 
of wrapping the real line around the circle, a definition frequently used in 
high school trigonometry courses. Having made a proper definition of arc 
length, we will then be able to establish the formula C' = 27r for the 
circumference of a circle of radius r. 


In this chapter we will define what we mean by a smooth curve in the plane 
and what is meant by its arc length. These definitions are a good bit more 
tricky than one might imagine. Indeed, it is the subtlety of the definition of 
arc length that prevented us from defining the trigonometric functions in 
terms of wrapping the real line around the circle, a definition frequently 
used in high school trigonometry courses. Having made a proper definition 
of arc length, we will then be able to establish the formula C' = 27 for the 
circumference of a circle of radius r. 


By the “plane,” we will mean R? = C, and we will on occasion want to 
carefully distinguish between these two notions of the plane, i.e., two real 
variables x and y as opposed to one complex variable z = x + 2y. In 
various instances, for clarity, we will use notations like x + zy and (2, y), 
remembering that both of these represent the same point in the plane. As 

x + iy, it is a single complex number, while as (x, y) we may think of it as 
a vector in R? having a magnitude and, if nonzero, a direction. 


We also will define in this chapter three different kinds of integrals over 
such curves. The first kind, called “integration with respect to arc length,” 
will be completely analogous to the integral defined in [link] for functions 
on a closed and bounded interval, and it will only deal with functions whose 
domain is the set consisting of the points on the curve. The second kind of 
integral, called a “contour integral,” is similar to the first one, but it 
emphasizes in a critical way that we are integrating a complex-valued 
function over a curve in the complex plane C’ and not simply over a subset 
of R?. The applications of contour integrals is usually to functions whose 


domains are open subsets of the plane that contain the curve as a proper 
subset, i.e., whose domains are larger than just the curve. The third kind of 
integral over a curve, called a “line integral,” is conceptually very different 
from the first two. In fact, we won't be integrating functions at all but rather 
a new notion that we call “differential forms.” This is actually the 
beginnings of the subject called differential geometry, whose intricacies and 
power are much more evident in higher dimensions than 2. 


The main points of this chapter include: 


1. The definition of a smooth curve, and the definition of its arc length, 

2. the derivation of the formula C' = 27r for the circumference of a 
circle of radius r ([link]), 

. the definition of the integral with respect to arc length, 

. the definition of a contour integral, 

. the definition of a line integral, and 

. Green's Theorem ((link]). 


A uBR WwW 


Smooth Curves in the Plane 

Our first project is to make a satisfactory definition of a smooth curve in the plane, for there is a 
good bit of subtlety to such a definition. In fact, the material in this chapter is all surprisingly 
tricky, and the proofs are good solid analytical arguments, with lots of e's and references to 
earlier theorems. 


Our first project is to make a satisfactory definition of a smooth curve in the plane, for there is a 
good bit of subtlety to such a definition. In fact, the material in this chapter is all surprisingly 
tricky, and the proofs are good solid analytical arguments, with lots of e's and references to 
earlier theorems. 


Whatever definition we adopt for a curve, we certainly want straight lines, circles, and other 
natural geometric objects to be covered by our definition. Our intuition is that a curve in the 
plane should be a “1-dimensional” subset, whatever that may mean. At this point, we have no 
definition of the dimension of a general set, so this is probably not the way to think about 
curves. On the other hand, from the point of view of a physicist, we might well define a curve as 
the trajectory followed by a particle moving in the plane, whatever that may be. As it happens, 
we do have some notion of how to describe mathematically the trajectory of a moving particle. 
We suppose that a particle moving in the plane proceeds in a continuous manner relative to 
time. That is, the position of the particle at time ¢ is given by a continuous function 

f(t) = x(t) + ty(t) = (a(t), y(t)), as ¢ ranges from time a to time b. A good first guess at a 
definition of a curve joining two points z; and z2 might well be that it is the range C’ of a 
continuous function f that is defined on some closed bounded interval [a, b]. This would be a 
curve that joins the two points z; = f (a) and z2 = f (0b) in the plane. Unfortunately, this is 
also not a satisfactory definition of a curve, because of the following surprising and bizarre 
mathematical example, first discovered by Guiseppe Peano in 1890. 


THE PEANO CURVE The so-called “Peano curve” is a continuous function f defined on the 
interval (0, 1], whose range is the entire unit square [0, 1] x [0, 1] in R?. 


Be careful to realize that we're talking about the “range” of f and not its graph. The graph of a 
real-valued function could never be the entire square. This Peano function is a complex-valued 
function of a real variable. Anyway, whatever definition we settle on for a curve, we do not 
want the entire unit square to be a curve, so this first attempt at a definition is obviously not 
going to work. 


Let's go back to the particle tracing out a trajectory. The physicist would probably agree that the 
particle should have a continuously varying velocity at all times, or at nearly all times, i.e., the 
function f should be continuously differentiable. Recall that the velocity of the particle is 
defined to be the rate of change of the position of the particle, and that's just the derivative f’ of 
f. We might also assume that the particle is never at rest as it traces out the curve, i.e., the 
derivative f’ (t) is never 0. As a final simplification, we could suppose that the curve never 
crosses itself, i.e., the particle is never at the same position more than once during the time 
interval from t = a to t = b. In fact, these considerations inspire the formal definition of a 
curve that we will adopt below. 


Recall that a function f that is continuous on a closed interval [a, 6] and continuously 
differentiable on the open interval (a, b) is called a smooth function on [a, 6]. And, if there 
exists a partition {to < t; < ... < tn} of [a, b] such that f is smooth on each subinterval 
[t;_1, t;], then f is called piecewise smooth on [a, b]. Although the derivative of a smooth 
function is only defined and continuous on the open interval (a, b), and hence possibly is 
unbounded, it follows from part (d) of [link] that this derivative is improperly-integrable on that 
open interval. We recall also that just because a function is improperly-integrable on an open 
interval, its absolute value may not be improperly-integrable. Before giving the formal 
definition of a smooth curve, which apparently will be related to smooth or piecewise smooth 
functions, it is prudent to present an approximation theorem about smooth functions. [link] 
asserts that every continuous function on a closed bounded interval is the uniform limit of a 
sequence of step functions. We give next a similar, but stronger, result about smooth functions. 
It asserts that a smooth function can be approximated “almost uniformly” by piecewise linear 
functions. 


Let f be a smooth function on a closed and bounded interval |a, b], and assume that | f’| is 
improperly-integrable on the open interval (a, b). Given an € > 0, there exists a piecewise 
linear function p for which 


1. |f(x) — p(x)| < € for all x € [a, 8). 
2. f° |f' (x) —p' (a) | de <e. 


That is, the functions f and p are close everywhere, and their derivatives are close on average in 
the sense that the integral of the absolute value of the difference of the derivatives is small. 


Because f is continuous on the compact set |a, 6], it is uniformly continuous. Hence, let 6 > 0 
be such that if x, y € [a,b], and |x — y| < 6, then | f(x) — f(y)| < €/2. 


Because | f’| is improperly-integrable on the open interval (a, b), we may use part (b) of [link] 
to find a 6’ > 0, which may also be chosen to be < 6, such that = f'|+ is \f’| < €/2, 


and we fix sucha 0’. 


Now, because f’ is uniformly continuous on the compact set [a + 6’, b — 6’], there exists an 

a > Osuch that | f’ (x) — f’ (y)| < €/4(b — a) if x and y belong to [a + 6’, b — 6’] and 

|z — y| < a. Choose a partition {4p < 21 < ... < pn} of [a, b] such that 

fo = a,2,=a+6',2,_, =b—-6',x, = b, anda; — 2;_, <min (6,a) for2<i<n-1. 
Define p to be the piecewise linear function on [a, 6] whose graph is the polygonal line joining 
the n + 1 points (a, f (x1)),{(ai, f (w:))} for 1 <i <n -—1, and (6, f (ap_1)). That is, p is 
constant on the outer subintervals [a, 21] and [a,_1, b] determined by the partition, and its graph 
between xz, and z,,_; is the polygonal line joining the points 

{(@1, f (@1)), ---5 (@n_1, f (Ln_1))}. For example, for 2 < i <n —1, the function p has the 
form 

Equation: 


on the interval [#;_1, x;]. So, p(a) lies between the numbers f(z;-1) and f(;) for all 7. 
Therefore, 
Equation: 


If (2) — p(2)| < If (a) — f (@aI+1F (ei) — U(@)s If @) — Fad i+ If (ai) — f(tia)l< e. 


Since this inequality holds for all 2, part (1) is proved. 

Next, for 2 <i <m-—1, and for each x € (2;_1, 2;), we have 

p' (x) = (f (ai) — f (xi_1))/ (ai — 2-1), which, by the Mean Value Theorem, is equal to 

f' (yi) for some y; € (a;-1, 2:). So, for each such x € (2-1, 2;), we have 

|f’ (x) — p’ (x)| =|f' («) — fF’ (y;)], and this is less than €/4(b — a), because |r—y;|< a. On 
the two outer intervals, p(a) is a constant, so that p’ (2) = 0. Hence, 


Equation: 
b n x; 
i \f'-2'| = >), lf’ —2'| 
a i=1 Y¥ Vi-1 
L1 n-1 b 
fre Slee fe 
a 1=2 Tn-1 


ie 
a 
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IA 
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The proof is now complete. 


REMARK It should be evident that the preceding theorem can easily be generalized to a 
piecewise smooth function f, i.e., a function that is continuous on [a, b], continuously 
differentiable on each subinterval (t;_1,t;) of a partition {to < ti < ... < tn}, and whose 
derivative f’ is absolutely integrable on (a, b). Indeed, just apply the theorem to each of the 
subintervals (t;_1,¢;), and then carefully piece together the piecewise linear functions on those 
subintervals. 


Now we are ready to define what a smooth curve is. 


By asmooth curve from a point z; to a different point z2 in the plane, we mean a set 

C' C C that is the range of a 1-1, smooth, function ¢ : [a, b] + C, where [a, 5] is a 
bounded closed interval in R, where z; = y (a) and z2 = ¢ (0), and satisfying y’ (t) 0 
for all t € (a, b). 


More generally, if y : [a,b] + R? is 1-1 and piecewise smooth on [a, bj, and if 
{to < ti <... < tn} isa partition of [a, b] such that y’ (t) 4 0 for all ¢ € (t;_1, ¢;), then 


the range C of ¢ is called a piecewise smooth curve from z; = ¢ (a) to z2 = y (0b). 
In either of these cases, y is called a parameterization of the curve C’. 


Note that we do not assume that |y’| is improperly-integrable, though the preceding theorem 
might have made you think we would. 


REMARK Throughout this chapter we will be continually faced with the fact that a given curve 
can have many different parameterizations. Indeed, if y; : [a,b] —> C is a parameterization, and 
if g : [c,d] — [a, b] is a smooth function having a nonzero derivative, then 2 (s) = y1 (g(s)) 
is another parameterization of C’. Since our definitions and proofs about curves often involve a 
parametrization, we will frequently need to prove that the results we obtain are independent of 
the parameterization. The next theorem will help; it shows that any two parameterizations of C’ 
are connected exactly as above, i.e., there always is such a function g relating y1 and yo. 


Let v1 : [a,b] > Cand ¢2 : |c, d] + C be two parameterizations of a piecewise smooth curve 
C joining z1 to z2. Then there exists a piecewise smooth function g : |[c, d] — [a, b] such that 
y2 (s) = ¥1 (g(s)) for all s € [c, d]. Moreover, the derivative g’ of g is nonzero for all but a 
finite number of points in [c, d]. 


Because both y, and ¢g2 are continuous and 1-1, it follows from [link] that the function 

9 = £1. © $2 is continuous and 1-1 from [c, d] onto [a, 6]. Moreover, from [link], it must also 
be that g is strictly increasing or strictly decreasing. Write 

g1 (t) = ur (¢) + tv1 (t) = (u(t), v1 (¢)), and Yo (8) = ue (s) + ive (s) = (ua (s), v2 (s)). 
Let {zp < 21 <... < xp} bea partition of [a, b| for which ¢{ is continuous and nonzero on the 
subintervals (a;-1,2,;), and let {yo < yi < ... < yg} be a partition of [c, d] for which y% is 
continuous and nonzero on the subintervals (yz_1, yz). Then let {so < $1 <... < 8,} be the 
partition of [c, d] determined by the finitely many points {y;,} U {g~* (x;)}. We will show that 
g is continuously differentiable at each point s in the subintervals (s;_1, s;). 


Fix an s in one of the intervals (s;_1, s;), and let t = yy" (yo (s)) = g(s). Of course this 
means that y (t) = 2 (s), or ui (t) = ue (s) and vj (t) = v2 (s). Then ¢ is in some one of 
the intervals (7;_1, x;), so that we know that { (t) 4 0. Therefore, we must have that at least 
one of wu} (t) or v{ (t) is nonzero. Suppose it is v‘, (¢) that is nonzero. The argument, in case it is 
u; (t) that is nonzero, is completely analogous. Now, because v;, is continuous at t and 

v}, (t) 4 0, it follows that v is strictly monotonic in some neighborhood (t — 6,t + 6) of t and 
therefore is 1-1 on that interval. Then wa is continuous by [link], and is differentiable at the 
point v, (t) by the Inverse Function Theorem. We will show that on this small interval 

g= iG © Ug, and this will prove that g is continuously differentiable at s. 


Note first that if y2 (0) = x + iy is a point on the curve C, then v2 (y5* (x + iy)) = y. Then, 
for any T € [a, b], we have 
Equation: 


showing that Vv, 0V’2 = gi = g. Hence g is continuously differentiable at every point s in 
the subintervals (s;_1, s;). Indeed g’ (a) = vy (v2 (7))v5 (a) for all o near s, and hence g is 
piecewise smooth. 


Obviously, ye (s) = 91 (g(s)) for all s, implying that v5 (s) = vy} (g(s))g’ (s). Since 
~> (s) # 0 for all but a finite number ae points s, it follows that g’ (s) # 0 for all but a finite 


ae of points, and the theorem is proved. 


Let y, and ¢% be as in the theorem. Then, for all but a finite number of points 
z= 1 (t) = 2 (s) on the curve C, we have 
Equation: 


From the theorem we have that 
Equation: 


5 (8) = 91 (9(s))9' (s) = ¢) (9's) 


for all but a finite number of points s € (c,d). Also, g is strictly increasing, so that g' (s) > 0 
for all points s where g is differentiable. And in fact, g’ (s) 4 0 for all but a finite number of s's, 


. F we ! - / 
because g’ (s) is either (v;' 0 vg) (s) or (uy! o ug) (s), and these are nonzero except for a 
finite number of points. Now the corollary follows by direct substitution. 


REMARK If we think of y’ (t) = (2’ (t), y’ (t)) as a vector in the plane R?, then the corollary 
asserts that the direction of this vector is independent of the parameterization, at least at all but a 
finite number of points. This direction vector will come up again as the unit tangent of the 
curve. 


The adjective “smooth” is meant to suggest that the curve is bending in some reasonable way, 
and specifically it should mean that the curve has a tangent, or tangential direction, at each 
point. We give the definition of tangential direction below, but we note that in the context of a 
moving particle, the tangential direction is that direction in which the particle would continue to 
move if the force that is keeping it on the curve were totally removed. If the derivative 

y’ (t) # 0, then this vector is the velocity vector, and its direction is exactly what we should 
mean by the tangential direction. 


The adjective “piecewise” will allow us to consider curves that have a finite number of points 
where there is no tangential direction, e.g., where there are “corners.” 


We are carefully orienting our curves at the moment. A curve C’' from z to 2 is being 
distinguished from the same curve from Z2 to z1, even though the set C is the same in both 
instances. Which way we traverse a curve will be of great importance at the end of this chapter, 
when we come to Green's Theorem. 


Let C, the range of y : [a,b] —> C, be a piecewise smooth curve, and let 

z = (x,y) = (c) bea point on the curve. We say that the curve C has a tangential 
direction at z, relative to the parameterization y, if the following limit exists: 
Equation: 


If this limit exists, it is a vector of length 1 in R?, and this unit vector is called the unit 
tangent (relative to the parameterization _) to C at z. 


The curve C has a unit tangent at the point z if there exists a parameterization y for which 
the unit tangent at z relative to y exists. 


Exercise: 


Problem: 


a. Restate the definition of tangential direction and unit tangent using the R? version of 
the plane instead of the C' version. That is, restate the definition in terms of pairs 
(x, y) of real numbers instead of a complex number z. 

b. Suppose ¢ : [a,b] + Cis a parameterization of a piecewise smooth curve C,, and that 
t € (a,b) is a point where ¢ is differentiable with y’ (t) 4 0. Show that the unit 
tangent (relative to the parameterization y) to C' at z = y(t) exists and equals 
y’ (t)/ |’ (t)|. Conclude that, except possibly for a finite number of points, the unit 
tangent to C’ at z is independent of the parameterization. 

c. Let C be the graph of the function f(t) = |t| for ¢ € [—1, 1]. Is C a smooth curve? Is 
it a piecewise smooth curve? Does C' have a unit tangent at every point? 

d. Let C be the graph of the function f (t) = t?/? = (¢1/3)° fort € [-1,1].IsCa 
smooth curve? Is it a piecewise smooth curve? Does C’ have a unit tangent at every 
point? 

e. Consider the set C’ that is the right half of the unit circle in the plane. Let 
yi : [-1,1] > C be defined by 


Equation: 
v1 (t) = (cos (¢=), sin (‘= )), 
2 2 


and let y2 : [-1, 1] — C be defined by 


Equation: 


Ye (t) = (cos (5), sin (#5). 


Prove that 1 and wz are both parameterizations of C’. Discuss the existence of a unit 
tangent at the point (1,0) = v1 (0) = 2 (0) relative to these two parameterizations. 
f. Suppose y : [a,b] > C is a parameterization of a curve C from 2; to z2. Define w on 
[a, b| by Y(t) = y(a + b — t). Show that w is a parameterization of a curve from Zz» 

to 21. 


Exercise: 


Problem: 


a. Suppose f is a smooth, real-valued function defined on the closed interval [a, b|, and 
let C C R? be the graph of f. Show that C is a smooth curve, and find a “natural” 
parameterization y : [a, b] + C of C. What is the unit tangent to C at the point 
(t, f(t)? 

b. Let z; and zg be two distinct points in C,, and define ¢ : [0,1] > c by 
y (t) = (1 — t)z1 + tza. Show that ¢ is a parameterization of the straight line from 
the point z, to the point z2. Consequently, a straight line is a smooth curve. (Indeed, 
what is the definition of a straight line?) 


c. Define a function y : [—r,r] > R? by y(t) = («, Vr? — i). Show that the range 


C of y is asmooth curve, and that y is a parameterization of C’. 

d. Define y on [0, 7/2) by y (t) = e”. For what curve is y a parametrization? 

e. Let 21, 22, ..., Z, be n distinct points in the plane, and suppose that the polygonal line 
joing these points in order never crosses itself. Construct a parameterization of that 
polygonal line. 

. Let S be a piecewise smooth geometric set determined by the interval [a, b] and the 
two piecewise smooth bounding functions wu and l. Suppose z; and 2» are two points 
in the interior S° of S. Show that there exists a piecewise smooth curve C joining z1 


oar) 


to 22, i.e., a piecewise smooth function y : fa, b| — Cwith y (a) = z, and 
7) (é) = zg, that lies entirely in S°. 


g. Let C be a piecewise smooth curve, and suppose ¢ : [a,b] —> C is a parameterization 
of C’.. Let |c, d] be a subinterval of [a, 6]. Show that the range of the restriction of y to 
[c, d| is a smooth curve. 


Exercise: 


Problem: Suppose C is a smooth curve, parameterized by y = u + iv: [a,b] > C. 


a. Suppose that wu’ (t) 4 0 for all t € (a, b). Prove that there exists a smooth, real- 
valued function f on some closed interval [a’, b’] such that C' coincides with the graph 


of f. HINT: f should be something like v o u7!. 
b. What if vu’ (t) £ 0 for all t € (a, b)? 


Exercise: 


Problem: 


Let C be the curve that is the range of the function y : [—1, 1] + C, where 
y(t) = t? + t%). 
a. Is C' a piecewise smooth curve? Is it a smooth curve? What points z, and z» does it 
join? 
b. Is y a parameterization of C’? 


c. Find a parameterization for C' by a function w : [3, 4] > C. 
d. Find the unit tangent to C’ and the point 0 + 02. 


Exercise: 
Problem: 


Let C be the curve parameterized by y : [—7, 7 — e| + C defined by 
y (t) =e” =cos (t) +7 sin (t). 


a. What curve does y parameterize? 
b. Find another parameterization of this curve, but base on the interval [0,1 — €]. 


Arc Length 
A formula for calculating arc length, an exercise exploring the possibility of infinite 
length, and some other related theorems, remarks, and exercises. 


Suppose C’ is a piecewise smooth curve, parameterized by a function y. Continuing to 
think like a physicist, we might guess that the length of this curve could be computed as 
follows. The particle is moving with velocity y’ (t). This velocity is thought of as a vector 
in R?, and as such it has a direction and a magnitude or speed. The speed is just the 
absolute value |y’ (t)| of the velocity vector y’ (t). Now distance is speed multiplied by 
time, and so a good guess for the formula for the length L of the curve C' would be 
Equation: 


b 
L= / lp’ (t)| at. 


Two questions immediately present themselves. First, and of primary interest, is whether 
the function |y’| is improperly-integrable on (a, b)? We know by [link] that y’ itself is 
improperly-integrable, but we also know from [link] that a function can be improperly- 
integrable on an open interval and yet its absolute value is not. In fact, the answer to this 
first question is no (See [link].). We know only that |y’| exists and is continuous on the 
open subintervals of a partition of |a, 6]. 


The second question is more subtle. What if we parameterize a curve in two different 
ways, i.e., with two different functions y~; and y2? How do we know that the two integral 
formulas for the length have to agree? Of course, maybe most important of all to us, we 
also must justify the physicist's intuition. That is, we must give a rigorous mathematical 
definition of the length of a smooth curve and show that Formula ([link]) above does in 
fact give the length of the curve. First we deal with the independence of parameterization 
question. 


Let C' be a smooth curve joining (distinct) points z; to zg in C, and let v1 : [a,b] > C 
and zp : [c, d] + C be two parameterizations of C. Suppose |y,| is improperly-integrable 
on (c, d). Then |v} is improperly-integrable on (a,b), and 


Equation: 
b d 
/ / 
[leet a= [Veil as 


We will use [link]. Thus, let g = y,' © @o, and recall that g is continuous on [c, d] and 
continuously differentiable on each open subinterval of a certain partition of [c, d]. 
Therefore, by part (d) of [link], g’ is improperly-integrable on (c, d). 


Let {a9 < 41 <... < zp} bea partition of [a, b] for which y{ is continuous and nonzero 
on the subintervals (x;_1,2,;). To show that |y}| is improperly-integrable on (a, 6), it will 
suffice to show this integrability on each subinterval (x ;_1, x;). Thus, fix a closed interval 
[a’, b'| C (5-1, xj), and let [c’, d’] be the closed subinterval of [c, d] such that g maps 

[c’, d’| 1-1 and onto |a’, b’|. Hence, by part (e) of [link], we have 


Equation: 
bv! 
[ Whole = 


I 


which, by taking limits as a’ goes to z;_, and b’ goes to x, shows that |(p;| is improperly- 
integrable over (x;-1, 2;) for every j, and hence integrable over all of (a, b). Using part 
(e) of [link] again, and a calculation similar to the one above, we deduce the equality 


Equation: 
b d 
/ / 
/ P1 |= / |y2 F 
a c 


and the theorem is proved. 
Exercise: 
A curve of infinite length 


Problem: 


Let y : [0,1] : R? be defined by (0) = (0,0), and for t > 0, 
y(t) = (t,t sin (1/t)). Let C be the smooth curve that is the range of yp. 


a. Graph this curve. 
b. Show that 


Equation: 


|e’ (é)| = 4/14 sin’ (1/t) 


sin (2/t) : cos? (1/t) 
t t? 


#2 + # sin? (1/t) —t sin (2/t)+ cos? (1/t) 


c. Show that 
Equation: 
: 54 [4 — sin? (t) sin (2t) 
/ 
tl dt= —4/ — + — ——— + cos? (t) dt. 
[wola=f are ; ( 
d. Show that there exists an € > 0 so that for each positive integer n we have 


cos” (t)— sin (2t)/t > 1/2 for all t such that |t — nz| < e. 

. Conclude that |y’| is not improperly-integrable on (0, 1). Deduce that, if 
Formula ({link]) is correct for the length of a curve, then this curve has infinite 
length. 


oO 


Next we develop a definition of the length of a parameterized curve from a purely 
mathematical or geometric point of view. Happily, it will turn out to coincide with the 
physically intuitive definition discussed above. 


Let C be a piecewise smooth curve joining the points z; and zg, and let y : [a,b] — C be 
a parameterization of C. Let P = {a = tp < ty <... < tn = b} bea partition of the 
interval [a, 6]. For each 0 < j < n write z; = y (t;), and think about the polygonal 
trajectory joining these points {z,} in order. The length L¥, of this polygonal trajectory is 
given by the formula 

Equation: 


n 

=) 

LS = |2; — 23-1, 
j=l 


and this length is evidently an approximation to the length of the curve C’. Indeed, since 
the straight line joining two points is the shortest curve joining those points, these 
polygonal trajectories all should have a length smaller than or equal to the length of the 
curve. These remarks motivate the following definition. 


Let y : [a,b] > C be a parameterization of a piecewise smooth curve CC C. By the 
lengthL*” of C, relative to the parameterization y, we mean the number 


L? =supp L*, where the supremum is taken over all partitions P of [a, b]. 


Of course, the supremum in the definition above could well equal infinity in some cases. 
Though it is possible for a curve to have an infinite length, the ones we will study here will 
have finite lengths. This is another subtlety of this subject. After all, every smooth curve is 
a compact subset of R?, since it is the continuous image of a closed and bounded interval, 
and we think of compact sets as being “finite” in various ways. However, this finiteness 
does not necessarily extend to the length of a curve. 

Exercise: 


Problem: 


Let y : [a,b] > R? be a parameterization of a piecewise smooth curve C, and let P 
and Q be two partitions of [a, 5]. 


a. If P is finer than Q, i.e., Q C P, show that Lo — De: 
b. If p(t) = u(t) + iv(t), express L¥ in terms of the numbers u(t;) and v(t;). 


Of course, we again face the annoying possibility that the definition of length of a curve 
will depend on the parameterization we are using. However, the next theorem, taken 
together with [link], will show that this is not the case. 


If C is a piecewise smooth curve parameterized by  : [a,b] + C, then 
Equation: 


b 
Le = [|p (| ae 


specifically meaning that one of these quantities is infinite if and only if the other one is 
infinite. 


We prove this theorem for the case when C’ is a smooth curve, leaving the general 
argument for a piecewise smooth curve to the exercises. We also only treat here the case 
when L* is finite, also leaving the argument for the infinite case to the exercises. Hence, 
assume that y = u + iv is a smooth function on [a, 6] and that L? < oo. 


Let € > 0 be given. Choose a partition P = {tp < t1 < ... < ty} of [a, b] for which 
Equation: 


L? -L2=L° — J |p (ts) — e (ty)| <e. 
j=1 


Because y is continuous, we may assume by making a finer partition if necessary that the 
t;'s are such that |y (t1) — y (to)|< € and |y (tr) — y (tn_1)|< €. This means that 
Equation: 


n—-1 
L° ~SIp (ti) — 9 (ty) <3e. 
j=2 


The point of this step (trick) is that we know that y’ is continuous on the open interval 
(a, b), but we will use that it is uniformly continuous on the compact set [¢1, tn_1]. Of 
course that means that |y’| is integrable on that closed interval, and in fact one of the 
things we need to prove is that |’| is improperly-integrable on the open interval (a, b). 


Now, because y’ is uniformly continuous on the closed interval [t;,t,—1], there exists a 

5 > Osuch that |y’ (t) — y’ (s)| < € if |t — s| < dandt and s are in the interval 

[t1, tn_1]. We may assume, again by taking a finer partition if necessary, that the mesh size 
of Pis less than this 6. Then, using part (f) of [link], we may also assume that the partition 
P is such that 


Equation: 
tn—1 n-1 
/ / 
[|e Ola Sle Cale — Hal <e 
ty j=2 


no matter what points s; in the interval (t;-1, t;) are chosen. So, we have the following 
calculation, in the middle of which we use the Mean Value Theorem on the two functions 
u and v. 

Equation: 


tn-1 
0< ue | 
ty 
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n—-1 tn—1 
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n—1 Gn-1 
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- frien 
Be 3 (u! (55))? + (ov (53)? (ty — ty) 

- flowin 

2 (u(83))? + (0 (my))? — yf (u(s))? + (w! (83))?| (t5 — ty) 
3e+| 3 ! (s))] (ty — ty) - / ” g(t) | ae 

Ss V (u(53))? + (0! rp)? = alsa)? + (&(8,))"] () — Ba) 
St le! (v3)? — et sot 


F Jiwtaa Hw + Vee wi wea? 


=I Jo! (rj) — 0" (84) [lo" (rg) + (5 ce 
ae > re (rg) FHI” (9) 


(t; — tj-1) 
te + 5 |e’ ( r;) —v' (s;)| (t; —t;-1) 


j=2 


ter Sle r;) 85)| (t; — t3-1) 


4e + ye (t; = t4) 
j=2 


4e + €(tr—1 _ ty) 


< &4+b-a). 


This implies that 
Equation: 


tn—-1 
LP e(4+b-a) < [ |p’| < L? +€(44+b-a). 


ty 


If we now let t; approach a and t,_1 approach b, we get 
Equation: 


b 
Le e(4+b-a) < [ ly'| < L? +e(4+6-a), 


which completes the proof, since € is arbitrary. 
Exercise: 


Problem: 


a. Take care of the piecewise case in the preceding theorem. 
b. Take care of the case when L”* is infinite in the preceding theorem. 


We now have all the ingredients necessary to define the length of a smooth curve. 


Let C' be a piecewise smooth curve in the plane. The length or arc lengthL = L(C) 
of C is defined by the formula 
Equation: 


L(C) = L* =uP Le. 


where y is any parameterization of C. 


If z and w are two points on a piecewise smooth curve C’, we will denote by L(z, w) 
the arc length of the portion of the curve between z and w. 


REMARK According to [link] and [link], we have the following formula for the length of 
a piecewise smooth curve: 
Equation: 


b 
L= / |y’ ()| dt, 


where y is any parameterization of C. 


It should come as no surprise that the length of a curve C’ from z, to z2 is the same as the 
length of that same curve C’, but thought of as joining z2 to z;. Nevertheless, let us make 
the calculation to verify this. If y : [a,b] + Cis a parameterization of this curve from 2, 
to zg, then we have seen in part (f) of exercise 6.1 that w : [a,b] — C, defined by 

v(t) = y(a+ b — £), is a parameterization of C' from Zz to z,. We just need to check that 
the two integrals giving the lengths are equal. Thus, 

Equation: 


i 


where the last equality follows by changing variables, i.e., setting t= a-+b-—s. 


b b 


wi (pia= [ 


a 


¢(a+b-#)(-1)/ae= | 


a 


b 
g(a+b-piat= f |v’ (s)| ds, 


We can now derive the formula for the circumference of a circle, which was one of our 
main goals. TRUMPETS? 


Let C' be a circle of radius r in the plane. Then the length of C is 2zr. 


Let the center of the circle be denoted by (h, &). We can parameterize the top half of the 
circle by the function ¢ on the interval [0,7] by y(t) = h+r cos (t) + i(k +, sin (t)). 
So, the length of this half circle is given by 

Equation: 


b= | 
0 


The same kind of calculation would show that the lower half of the circle has length zr, 
and hence the total length is 27r. 


Tv 


e(piae= f iain tee war =f Hate 
0 0 


The integral formula for the length of a curve is frequently not much help, especially if 
you really want to know how long a curve is. The integrals that show up are frequently not 
easy to work out. 

Exercise: 


Problem: 


a. Let C be the portion of the graph of the function y = x? between x = 0 and 
xz =1.Let y: [0,1] — Cbe the parameterization of this curve given by 
y (t) = t+ 7%. Find the length of this curve. 

b. Define y : [—0, 7] + C by y(t) = a cos (t) + 2b sin (¢). What curve does y 
parameterize, and can you find its length? 


Integration with Respect to Arc Length 

We introduce next what would appear to be the best parameterization of a 
piecewise smooth curve, i.e., a parameterization by arc length. We will then 
use this parameterization to define the integral of a function whose domain 
is the curve. 


We introduce next what would appear to be the best parameterization of a 
piecewise smooth curve, i.e., a parameterization by arc length. We will then 
use this parameterization to define the integral of a function whose domain 
is the curve. 


Let C be a piecewise smooth curve of finite length L joining two distinct 
points z; to 22. Then there exists a parameterization y : [0, L] + C for 
which the arc length of the curve joining y(t) to y(w) is equal to |u — ¢| for 
allt <u é (0, L}. 


Let y : [a,b] + C be a parameterization of C’. Define a function 
F : [a,b] — [0, LZ] by 
Equation: 


F=f |e (s)| ds, 


In other words, F'(t) is the length of the portion of C that joins the points 
z1 = y (a) and y(t). By the Fundamental Theorem of Calculus, we know 
that the function F’ is continuous on the entire interval [a, b] and is 
continuously differentiable on every subinterval (¢;_1, ¢;) of the partition P 
determined by the piecewise smooth parameterization y. Moreover, 

F' (t) = |y’ (t)| > 0 forall t € (¢;_1,t;), implying that F is strictly 
increasing on these subintervals. Therefore, if we write s; = F (t;), then 
the s;'s form a partition of the interval [0, Z|, and the function 

F : (t;-1,t;) + (s;-1, $;) is invertible, and its inverse F~! is continuously 
differentiable. It follows then that y = yo F~!: [0, L] > Cisa 
parameterization of C’. The arc length between the points y(t) and y(w) is 
the arc length between y(F'~' (t)) and y(F' (u)), and this is given by 
the formula 


Equation: 


Fu) F~(u) F-1(t) 
i. / ? (s)|as— f |v" (s)| ds 
= F(F"(u)) -F(F'()) 


= u-t, 


6 
— 
J 
S. 
Vs) 
| 


which completes the proof. 


If 7 is the parameterization by arc length of the preceding theorem, then, for 
all t € (s;-1, $;), we have |’ (s)| = 1. 


We just compute 


Equation: 
ly (s)| = |(poF*)'(s)| 
= |p’ (F1(s))(F*)'(s)| 
! —1 1 
- ’ a (ll Eras) 
! —1 1 
ere 0) ee 
ae 
as desired. 


We are now ready to make the first of our three definitions of integral over 
a curve. This first one is pretty easy. 


Suppose C is a piecewise smooth curve joining z1 to z2 of finite length L, 
parameterized by arc length. Recall that this means that there is a 1-1 
function from the interval {0, Z] onto C that satisfies the condidition that 
the arc length betweenthe two points y(t) and y(s) is exactly the distance 


between the points t and s. We can just identify the curve C’ with the 
interval [0, Z|, and relative distances will correspond perfectly. A partition 
of the curve C' will correspond naturally to a partition of the interval [0, LZ). 
A step function on the dcurve will correspond in an obvious way to a step 
function on the interval [0, Z|, and the formula for the integral of a step 
function on the curve is analogous to what it is on the interval. Here are the 
formal definitions: 


Let C’ be a piecewise smooth curve of finite length L joining distinct 
points, and let + : [0, L] + C be a parameterization of C' by arc 
length. By a partition of C' we mean a set {Zo, 21, ---; Zn} of points on 
C such that z; = y(t;) for all 7, where the points 

{to < t1 <... < t,} form a partition of the interval {0, Z|. The 
portions of the curve between the points z;_1 and z,, i.e., the set 
7(t;-1, t;), are called the elements of the partition. 


A step fucntion on C is a real-valued function h on C’ for which there 
exists a partition {29, 21, ..., Zn} of C’ such that h(z) is a constant a, 


on the portion of the curve between z;_1 and z;. 


Before defining the integral of a step function on a curve, we need to 
establish the usual consistency result, encountered in the previous cases of 
integration on intervals and integration over geometric sets, the proof of 
which this time we put in an exercise. 

Exercise: 


Problem: 


Suppose A is a function on a piecewise smooth curve of finite length 
L, and assume that there exist two partitions {z9, 21, ...,Z,} and 
{wWo, W1,---) Wm} of C such that h(z) is a constant a, on the portion 
of the curve between zz_1 and z,, and h(z) is a constant b; on the 
portion of the curve between w,_; and w;. Show that 

Equation: 


m 


So an (2¢-1; 2k) = > B/D (wz-1, wy). 
k=1 


j=l 


HINT: Make use of the fact that h o y is a step function on the interval 
[0, L}. 


Now we can make the definition of the integral of a step function on a 
curve. 


Let h be a step function on a piecewise smooth curve C of finite 
length L. The integral, with respect to arc length of h over C’ is 
denoted by [., h(s) ds, and is defined by 


Equation: 


| h(s)ds = > @3 1D ( 25-1523); 
Cc j=l 


where { Zo, 21, ---) 2n} is a partition of C for which h(z) is the constant 
a, on the portion of C’ between z,_, and z;. 


Of course, integrable functions on C’ with respect to arc length will be 
defined to be functions that are uniform limits of step functions. Again, 
there is the consistency issue in the definition of the integral of an 
integrable function. 

Exercise: 


Problem: 


a. Suppose {h,,} is a sequence of step functions on a piecewise 
smooth curve C' of finite length, and assume that the sequence 
{hy} converges uniformly to a function f. Prove that the 
sequence { if aia (8) ds} is a convergent sequence of real 
numbers. 

b. Suppose {h,,} and {k,,} are two sequences of step functions on a 
piecewise smooth curve C’ of finite length /, and that both 
sequences converge uniformly to the same function f. Prove that 
Equation: 


im | hy (s) ds =tim | kn (s) ds. 
C CG 


Let C be a piecewise smooth curve of finite length L. A function f 
with domain C is called integrable with respect to arc length on C if it 
is the uniform limit of step functions on C. 


The integral with respect to arc length of an integrable function f on 
C is again denoted by {/, f (s) ds, and is defined by 
Equation: 


[ Cree [ more 


where {h,,} is a sequence of step functions that converges uniformly 


to fond. 


In a sense, we are simply identifying the curve C with the interval [0, LZ] by 
means of the 1-1 parameterizing function y. The next theorem makes this 
quite plain. 


Let C be a piecewise smooth curve of finite length L, and let y be a 
parameterization of C’ by arc length. If f is an integrable function on C, 
then 

Equation: 


[soe=f rowee 


First, if h is a step function on C, let {z;} be a partition of C’ for which 
h(z) is a constant a; on the portion of the curve between z;_; and z;. Let 
{t;} be the partition of [0, L] for which z; = y(t,;) for every 7. Note that 


ho -vyisastep function on [0, LZ], and that h o y(t) =a, for all 
te (ages): Then, 


Equation: 
/ h(s)ds = 
C 


S| a; (tj — tj-1) 
j=1 


L 
/ ho y(t) dt, 
0 


which proves the theorem for step functions. 
Finally, if f =lim h, is an integrable function on C’, then the sequence 


{h,, © y} converges uniformly to f o yon [0, LZ], and so 
Equation: 


[teas = tim | hn (8) ds 
L 
lim / hn (y (t)) dt 


l 
= / f (y(t) at, 


where the final equality follows from [link]. Hence, [link] is proved. 


Although the basic definitions of integrable and integral, with respect to arc 
length, are made in terms of the particular parameterization 7 of the curve, 


for computational purposes we need to know how to evaluate these 
integrals using different parameterizations. Here is the result: 


Let C’ be a piecewise smooth curve of finite length L, and let 

y : [a,b] > C’be a parameterization of C. If f is an integrable function on 
C’. Then 

Equation: 


b 
/ f(s) ds = / f (v(t) |v (lat. 
C a 


Write y : [0, L] > C for a parameterization of C' by arc length. As in the 
proof to [link], we write g : [a,b] —> [0, L] for y~' o . Just as in that 
proof, we know that g is a piecewise smooth function on the interval [a, 6]. 
Hence, recalling that |-y’ (t)| = 1 and g’ (t) > 0 for all but a finite number 
of points, the following calculation is justified: 

Equation: 


[ teas 


L 
/ f (y(t) at 
L 


f(y (t)) ly’ | at 


b 


f (y(9(u))) |y' 9 (u)) [9° (u) du 


(y(9(u)))Iy' (9 (u))Il9" (u)| du 


| 
, 


| 
os 
6 
S 
=e 
& 
. 
S 
S 
& 


_ Fe) |(‘gamma o g)' (u)| du 


b 


| 
, 


(y (u)) |v’ (u)| du, 


as desired. 
Exercise: 


Problem: 


Let C be the straight line joining the points (0, 1) and (1, 2). 


a. Find the arc length parameterization ¥ : 0, J 2] => C. 


b. Let f be the cae on this curve given by f (x, y) = xy. 
Compute [., f (s) ds. 

c. Let f be the eee on this curve that is defined by f(z, y) is 
the distance from (x, y) to the point (0,3). Compute f, f (s) ds 


The final theorem of this section sums up the properties of integrals with 
respect to arc length. There are no surprises here. 


Let C be a piecewise smooth curve of finite length L, and write I(C) for 
the set of all functions that are integrable with respect to arc length on C. 
Then: 


1. I(C) is a vector space ovr the real numbers, and 
Equation: 


[ars +09 (s))as=af F(s)as+b [ 9(s)as 


for all f,g € I(C) andalla,be R. 

2. (Positivity) If f(z) > 0 for all z € C, then J, f (s) ds > 0. 

3. If f € I(C), then so is |f|, and | f., f ( eat )| ds. 

4. If f is the uniform limit yi ae - each of which is in I(C), 
then f € I(C) and [,, f (s) ds =lim fa fn (s) ds 

5. Let {un } - a ae os ee in I(C), and suppose that for 
each n there is a number m,,, for which |u,, (z)| < my for all z € C, 
and such that the infinite series 5) m,, converges. Then the infinite 


series } wy converges uniformly to an integrable function, and 


lad ta (6) de= >, [ota (a) ds. 


Exercise: 


Problem: 


a. Prove the preceding theorem. Everything is easy if we compose 
all functions on C’ with the parameterization y, obtaining 
functions on [0, D], and then use [link]. 

b. Suppose C is a piecewise smooth curve of finite length joining z; 
and z2. Show that the integral with respect to arc length of a 
function f over C’ is the same whether we think of C' as being a 
curve from z1 to 22 or, the other way around, a curve from 22 to 
Z1- 


REMARK Because of the result in part (b) of the preceding exercise, we 
speak of “integrating over C” when we are integrating with respect to arc 
length. We do not speak of “integrating from z; to Z2,” since the direction 
doesn't matter. This is in marked contrast to the next two kinds of integrals 
over curves that we will discuss. 


here is one final bit of notation. Often, the curves of interest to us are graphs 
of real-valued functions. If g : [a,b] + R is a piecewise smooth function, 
then its graph C’ is a piecewise smooth curve, and we write 

oer f (s) ds for the integral with respect to arc length of f over 


C = graph(q). 


Contour Integrals 

We discuss next what appears to be a simpler notion of integral over a 
curve. In this one, we really do regard the curve C’ as a subset of the 
complex plane as opposed to two-dimensional real space; we will be 
integrating complex-valued functions; and we explicitly think of the 
parameterizations of the curve as complex-valued functions on an interval 
[a, b]. Also, in this definition, a curve C’ from 2; to z will be distinguished 
from its reverse, i.e., the same set C’ thought of as a curve from 22 to 2. 


We discuss next what appears to be a simpler notion of integral over a 
curve. In this one, we really do regard the curve C’ as a subset of the 
complex plane as opposed to two-dimensional real space; we will be 
integrating complex-valued functions; and we explicitly think of the 
parameterizations of the curve as complex-valued functions on an interval 
[a, b]. Also, in this definition, a curve C from zj to z2 will be distinguished 
from its reverse, i.e., the same set C’ thought of as a curve from 22 to 2}. 


Let C’' be a piecewise smooth curve from 2, to z2 in the plane C, 
parameterized by a (complex-valued) function y : [a,b] > C. If fisa 
continuous, complex-valued function on C’, The contour integral of f 
from z to z2 along C' will be denoted by [ o f (¢) d¢ or more precisely 
by f Cx, f (©) dé, and is defindd by 


Equation: 


Z9 b 
FQa= | Helt)e' Oa. 
Czy a 


REMARK There is, as usual, the question about whether this definition 
depends on the parameterization. Again, it does not. See the next exercise. 


The definition of a contour integral looks very like a change of variables 
formula for integrals. See [link] and part (e) of [link]. This is an example of 
how mathematicians often use a true formula from one context to make a 
new definition in another context. 


Notice that the only difference between the computation of a contour 
integral and an integral with respect to arc length on the curve is the 
absence of the absolute value bars around the factor y’ (t). This will make 
contour integrals more subtle than integrals with respect to arc length, just 
as conditionally convergent infinite series are more subtle than absolutely 
convergent ones. 


Note also that there is no question about the integrability of f (y (t))y’ (t), 
because of [link]. f is bounded, vy’ is improperly-integrable on (a, b), and 
therefore so is their product. 

Exercise: 


Problem: 


a. State and prove the “independence of parameterization” result for 
contour integrals. 

b. Prove that 
Equation: 


“s(Qat=-[ F(Oae. 


Czy C22 


Just remember how to parameterize the curve in the opposite 
direction. 

c. Establish the following relation between the absolute value of a 
contour integral and a corresponding integral with respect to arc 
length. 

Equation: 


[r@ais [ira 


Not all the usual properties hold for contour integrals, e.g., like those in 
[link] above. The functions here, and the values of their contour integrals, 
are complex numbers, so all the properties of integrals having to do with 
positivity and inequalities, except for the one in part (c) of [link], no longer 


make any sense. However, we do have the following results for contour 
integrals, the verification of which is just as it was for [link]. 


Let C’ be a piecewise smooth curve of finite length joining z; to z2. Then 
the contour integrals of continuous functions on C’ have the following 
properties. 


1. If f and g are any two continuous functions on C’, and a and 6 are any 
two complex numbers, then 
Equation: 


[ar +ra@ya=af soac+o f aac, 


2. If f is the uniform limit on C of a sequence { f,,} of continuous 
functions, then JV, f (¢) d¢ =lim Ju fn (¢) dé. 

3. Let {u,,} be a sequence of continuous functions on C’,, and suppose 
that for each n there is a number m,, for which |u,, (z)| < my, for all 
z € C, and such that the infinite series Sm, converges. Then the 
infinite series 5° u,, converges uniformly to a continuous function, and 


fo Nitin (Q dl =¥ foun (0) de. 


In the next exercise, we give some important contour integrals, which will 
be referred to several times in the sequel. Make sure you understand them. 
Exercise: 


Problem: 


Let c be a point in the complex plane, and let r be a positive number. 
Let C be the curve parameterized by y : [—7, 7 — €] : C defined by 
y(t) =c+re” =c+r cos (t) + ir sin (t). For each integer 

n € Z, define f, (z) = (z—c)”. 


1. What two points z; and z2 does C’ join, and what happens to z2 
as € approaches 0? 

2. Compute fy fn (¢) d¢ for all integers n, positive and negative. 

3. What happens to the integrals computed in part (b) when € 
approaches 0? 


4. Set € = m, and compute J, fn (¢) d¢ for all integers n. 
5. Again, set € = 7. Evaluate 
Equation: 


cos (¢ — c) sin (¢ — c) 
ae Ee —~—* dé. 
[ fax ac and | ce ¢ 


HINT: Make use of the infinite series representations of the 
trigonometric functions. 


Vector Fields, Differential Forms, and Line Integrals 

We motivate our third definition of an integral over a curve by returning to physics. This 
definition is very much a real variable one, so that we think of the plane as R? instead of C. 
A connection between this real variable definition and the complex variable definition of a 
contour integral will emerge later. 


We motivate our third definition of an integral over a curve by returning to physics. This 
definition is very much a real variable one, so that we think of the plane as R? instead of C. 
A connection between this real variable definition and the complex variable definition of a 
contour integral will emerge later. 


By a vector field on an open subset U of R*, we mean nothing more than a continuous 
> 
function V (x, y) = (P (2, y), Q (x, y)) from U into R?. The functions P and Q are 


—> 
called the components of the vector field V. 


—_ 
We will also speak of smooth vector fields, by which we will mean vector fields V both 
of whose component functions P and Q have continuous partial derivatives 
Equation: 
tialP tialP tialQ ed tialQ 
tialx’ tialy’ tialx tialy 


on U. 


The idea from physics is to think of a vector field as a force field, i.e., something that exerts 


> 
a force at the point (x, y) with magnitude |V (2, y)| and acting in the direction of the vector 


Ve y). For a particle to move within a force field, “work” must be done, that is energy 
must be provided to move the particle against the force, or energy is given to the particle as 
it moves under the influence of the force field. In either case, the basic definition of work is 
the product of force and distance traveled. More precisely, if a particle is moving in a 


direction u within a force field, then the work done on the particle is the product of the 

component of the force field in the direction of u and the distance traveled by the particle in 
—_ 

that direction. That is, we must compute dot products of the vectors V (z, y) and u (z, y). 


Therefore, if a particle is moving along a curve C’, parameterized with respect to arc length 
by y: [0, L] > C, and we write y(t) = (x(t), y(t)), then the work W(z1, z2) done on the 


+ 
particle as it moves from z; = y (0) to zz = y (ZL) within the force field V, should 
intuitively be given by the formula 
Equation: 


Waa) = f V (v(t) 17!) pa 
P(a(t),y( 


-f[ 


/ Pdz+Qdy, 
C 


t))x' (t) + Q (a (t),y(4))y' (4) dt 


II 


where the last expression is explicitly defining the shorthand notation we will be using. 


The preceding discussion leads us to a new notion of what kind of object should be 
“integrated” over a curve. 


A differential form on a subset U of R? is denoted by w = Pdx + Qdy, and is 
determined by two continuous real-valued functions P and Q on U. We say that w is 
bounded or uniformly continuous if the functions P and Q are bounded or uniformly 
continuous functions on U. We say that the differential form w is smooth of order k if 
the set U is open, and the functions P and Q have continuous mixed partial derivatives 
of order k. 


If w = Pdz + Qdy is a differential form on a set U, and if C’ is any piecewise smooth 
curve of finite length contained in U, then we define the line integral Al cw of w over C' 
by 

Equation: 


L 
[w= [paerad=f Powe’) + ery Wat 
C C 0 


where y(t) = (a(t), y(t)) is a parameterization of C by arc length. 


REMARK There is no doubt that the integral in this definition exists, because P and Q are 
continuous functions on the compact set C’, hence bounded, and ¥’ is integrable, implying 
that both x’ and y’ are integrable. Therefore P (y (t))ax’ (t) + Q (7 (#))y’ (€) is integrable 
on (0, L). 


These differential forms w really should be called “differential 1-forms.” For instance, an 
example of a differential 2-form would look like R dxdy, and in higher dimensions, we 
could introduce notions of differential forms of higher and higher orders, e.g., in 3 
dimension things like Pdrdy + Q dzdy + Rdzdz. Because we will always be dealing 
with R?, we will have no need for higher order differential forms, but the study of such 
things is wonderful. Take a course in Differential Geometry! 


Again, we must see how this quantity f cw depends, if it does, on different 
parameterizations. As usual, it does not. 
Exercise: 


Problem: Suppose w = Pdx + Qdy is a differential form ona subset U of R?. 


a. Let C’ be a piecewise smooth curve of finite length contained in U that joins z to 
22. Prove that 
Equation: 


b 
[oq [parod= f P(y(t))a' (t) + Q(y (t))y' (t) at 


for any parameterization y : [a,b] + C having components x(t) and y(t). 


b. Let C be as in part (a), and let C denote the reverse of C, i.e., the same set C’ but 
thought of as a curve joining z2 to z;. Show that iP gS i Ww. 

c. Let C be as in part (a). Prove that 
Equation: 


if Pdz+Qdy| < (Mp + Ma)L, 
Cc 


where Mp and Mg are bounds for the continuous functions |P| and |Q| on the 
compact set C’, and where L is the length of C’. 


Example: 

The simplest interesting example of a differential form is constructed as follows. Suppose 
U is an open subset of R?, and let f : U > Rbea differentiable real-valued function of 
two real variables; i.e., both of its partial derivatives exist at every point (x, y) € U. (See 
the last section of Chapter IV.) Define a differential form w = df, called the differential of 
f, by 

Equation: 


tial tial 
df = celle eae ae 
tialx tialy 


ie., P = tialf/tialxz and Q = tialf /tialy. These differential forms df are called exact 
differential forms. 


REMARK Not every differential form w is exact, i.e., of the form df. Indeed, determining 
which w's are df's boils down to what may be the simplest possible partial differential 
equation problem. If w is given by two functions P and Q, then saying that w = df 
amounts to saying that f is a solution of the pair of simultaneous partial differential 
equations 

Equation: 


oad — P and aly _ 
tialx tialy 


See part (b) of the exercise below for an example of a nonexact differential form. 


Of course if a real-valued function f has continuous partial derivatives of the second order, 
then [link] tells us that the mixed partials f,,, and fy, must be equal. So, if 

w = Pdxz + Qdy = df for some such f, Then P and Q would have to satisfy 

tialP/tialy = tialQ/tialx. Certainly not every P and Q would satisfy this equation, so it 
is in fact trivial to find examples of differential forms that are not differentials of functions. 
A good bit more subtle is the question of whether every differential form Pdx + Qdy, for 
which tialP/tialy = tialQ/tialz, is equal to some df. Even this is not true in general, as 
part (c) of the exercise below shows. The open subset U on which the differential form is 
defined plays a significant role, and, in fact, differential forms provide a way of studying 
topologically different kinds of open sets. 


In fact, although it may seem as if a differential form is really nothing more than a pair of 
functions, the concept of a differential form is in part a way of organizing our thoughts 
about partial differential equation problems into an abstract mathematical context. This 
abstraction is a good bit more enlightening in higher dimensional spaces, i.e., in connection 
with functions of more than two variables. Take a course in Multivariable Analysis! 
Exercise: 


Problem: 


a. Solve the pair of simultaneous partial differential equations 
Equation: 


tialf _ 
tialr 


tialf _ 


ee 
i a tialy 


xz y. 


b. Show that it is impossible to solve the pair of simultaneous partial differential 
equations 
Equation: 


tialf _ tialf 3 
tial — a a tialy — 


Hence, conclude that the differential form w = (x + y)dzx + y*dy is not the 
differential df of any real-valued function f. 

Let U be the open subset of R? that is the complement of the single point (0, 0). 
Let P(x, y) = —y/ (x? + y*) and Q (2, y) = a/ (a? + y’). Show that 
tialP/tialy = tialQ /tialz at every point of U, but that w = Pdx + Qdy is not 
the differential df of any smooth function f on U. HINT: If P were f,, then f 
would have to be of the form f (x, y) = — tan! (x/y) + g(y), where g is some 
differentiable function of y. Show that if Q = f, then g(y) is a constant c. Hence, 
f(a, y) must be — tan~! (a/y) +c. But this function f is not continuous, let 
alone differentiable, at the point (1,0). Consider lim f(1, 1/n) and 

lim f(1,—1/n). 


ie) 


The next thing we wish to investigate is the continuity of cas a function of the curve C. 
This brings out a significant difference in the concepts of line integrals versis integrals with 
respect to arc length. For the latter, we typically think of a fixed curve and varying 
functions, whereas with line integrals, we typically think of a fixed differential form and 
variable curves. This is not universally true, but should be kept in mind. 


Let w = Pdx + Qdy be a fixed, bounded, uniformly continuous differential form on a set 
U in R?, and let C be a fixed piecewise smooth curve of finite length L, parameterized by 
y : [a,b] + C, that is contained in U. Then, given an € > O there exists a 6 > O such that, 


w— f. ‘a w| < € whenever the following conditions on 
the curve C hold: 


1. Cisa piecewise smooth curve of finite length L contained in U , parameterized by 
@: [a,b] 3 C. 

2. |p (t)—G (t)| < 6 forall t € [a,b]. 

3. f? |g! (t) — B (t)| dt < 6. 


Let € > 0 be given. Because both P and Q are bounded on U, let Mp and Mg be upper 
bounds for the functions | P| and |Q] respectively. Also, since both P and Q are uniformly 
continuous on U, there exists ad > 0 such that if |(c, d)— (c’, d’)|< 6, then 

|P (c,d) — P(c',d’)|< €/4L and |Q (c,d) — Q(c’, a’)\< €/4L. We may also choose this 
6 to be less than both €/4Mp and e/4Mg. Now, suppose C is a curve of finite length Ds 
ie i @ : [a,b] > C, and that |y (t)—@ (t)| < 6 for all t € (a, 6], and that 


rh |v’ (t ’(t)| < 6. Writing y(t) = (x(t), y(t)) and g(t) = (@ (t), G(t)), we have 
oa 


0 < [Paes Qdy- [ Par + Qay 
b 
= |[ PloW)e'®) - P@W)2 ®) + Qe) © -9(FW)i Oe 
b b 
< [| PwW)'®-P@W)# wld | QW)’ © -2(GW)i lat 
b b 
< f[ P@W)-P@O) |e Wla+ f IPO) le @-2 Ola 
b b 
+f ewe) -@ (6) ly Ola+ [19 (@W) ll - 9 What 
b b 
< vA o'(t)|dt+Mp | |x’ (t) — 2" (t)| de 
b b 
+az f W @ldrme [ Ww -9 lat 
€ b b 
< apf |e @lateme | |e - ola 
b b 
+az f |e OlateMe f |e - ela 
5 + 5 + Mpi + Mos 
as desired. 


Again, we have a special notation when the curve C is a graph. If g: [a,b] > Risa 
piecewise smooth function, then its graph C' is a piecewise smooth curve, and we write 
f uence) Pdzx + Q dy for the line integral of the differential form Pdx + Qdy over the 


curve C' = graph(g). 


As alluded to earlier, there is a connection between contour integrals and line integrals. It is 
that a single contour integral can often be expressed in terms of two line integrals. Here is 
the precise statement. 


Suppose C’ is a piecewise curve of finite length, and that f = u + iv is a complex-valued, 
continuous function on C’.. Let ~ : [a,b] + C be a parameterization of C, and write 

p(t) = a(t) + iy(t). Then 

Equation: 


[sa= [ War—vay+ | (wae + dy), 


We just compute: 
Equation: 


b 
+i / (v (y(t)! (t) + u(v (t))y! (t)) at 


= [ude vdy +i f vdeo + udy 
C c 


as asserted. 


Integration Around Closed Curves, and Green's Theorem 

Two problems are immediately apparent concerning integrating around a closed curve. First, 
where do we start on the curve, which point is the initial point? And second, which way to we 
go around the curve? Recall tha if y : [a, b] + Cis a parameterization of C, then 

w : [a,b] + C, defined by w(t) = y(a+ b — ft), is a parameterization of C' that is the 
reverse of y, i.e., it goes around the curve in the other direction. If we are integrating with 
respect to arc length, this reverse direction won't make a difference, but, for contour integrals 
and line integrals, integrating in the reverse direction will introduce a minus sign. 


Thus far, we have discussed integration over curves joining two distinct points z; and 22. 
Very important in analysis is the concept of integrating around a closed curve, i.e., one that 
starts and ends at the same point. There is nothing really new here; the formulas for all three 
kinds of integrals we have defined will look the same, in the sense that they all are described 
interms of some parameterization y. A parameterization y : [a, b] + C of a closed curve C’ 
is just like the parameterization for a curve joining two points, except that the two points 
(a) and ~(b) are equal. 


Two problems are immediately apparent concerning integrating around a closed curve. First, 
where do we start on the curve, which point is the initial point? And second, which way to we 
go around the curve? Recall tha if y : [a, b] + C is a parameterization of C, then 

w : [a,b] — C, defined by w(t) = y(a+ b — ft), is a parameterization of C' that is the 
reverse of y, i.e., it goes around the curve in the other direction. If we are integrating with 
respect to arc length, this reverse direction won't make a difference, but, for contour integrals 
and line integrals, integrating in the reverse direction will introduce a minus sign. 


The first question mentioned above is not so difficult to handle. It doesn't really matter where 
we start on a closed curve; the parameterization can easily be shifted. 
Exercise: 


Problem: 


Let y [a, b] > R? be a piecewise smooth function that is 1-1 except that y(a) = (0b). 
For each 0 < c < b—a, define G: [a+ c,b+c] : R? by G(t) = v(t) for 
a+c<t<b,and@(t) = y(t—b+aforb<t<b+e. 


1. Show that @ is a piecewise smooth function, and that the range C’ of ~ coincides 
with the range of @. 

2. Let f be an integrable (with respect to arc length) function on C’. Show that 
Equation: 


b b+c 
[ tem olae=f reo) |e ola. 
a a+c 
That is, the integral f, o f (s) ds of f with respect to arc length around the closed 


curve C’ is independent of where we start. 
3. Let f be a continuous complex-valued function on C’. Show that 


Equation: 


b b+e 
[ temeoa-[ seo)e oa. 


oe 


That is, the contour integral f of (¢) d¢ of f around the closed curve C is 
independent of where we start. 

4. Let w = Pdzx + Qdy be a differential form on C’. Prove that 
Equation: 


b+e 


b 
i P(y (t))a" (t) + Q(y(t)y' (t) dt = / P (@(t))& (t) + Q (@(t)) a (t) at. 


a+c 


That is, the line integral f, cw of w around C' is independent of where we start. 


The question of which way we proceed around a closed curve is one that leads to quite 
intricate and difficult mathematics, at least when we consider totaly general smooth curves. 
For our purposes it wil, suffice to study a special kind of closed curve, i.e., curves that are the 
boundaries of piecewise smooth geometric sets. Indeed, the intricate part of the general 
situation has a lot to do with determining which is the “inside” of the closed curve and which 
is the “outside,” a question that is easily settled in the case of a geometric set. Simple pictures 
make this general question seem silly, but precise proofs that there is a definite inside and a 
definite outside are difficult, and eluded mathematicians for centuries, culminating in the 
famous Jordan Curve Theorem, which asserts exactly what our intuition predicts: 

Jordan Curve Theorem 


The complement of a closed curve is the union of two disjoint components, one bounded and 
one unbounded. 


We define the bounded component to be the inside of the curve and the unbounded 
component to be the outside. 


We adopt the following convention for how we integrate around the boundary of a piecewise 
smooth geometric set S. That is, the curve C’g will consist of four parts: the lower boundary 
(graph of the lower bounding function J), the righthand boundary (a portion of the vertical 
line z = b), the upper boundary (the graph of the upper bounding function w), and finally the 
lefthand side (a portion of the vertical line x = a). By integrating around such a curve C's, 
we will always mean proceeding counterclockwise around the curves. Specifically, we move 
from left to right along the lower boundary, from bottom to top along the righthand boundary, 
from right to left across the upper boundary, and from top to bottom along the lefthand 
boundary. Of course, as shown in the exercise above, it doesn't matter where we start. 
Exercise: 


Problem: 


Let S be the closed piecewise smooth geometric set that is determined by the interval 
(a, b] and the two piecewise smooth bounding functions u and J. Assume that the 
boundary C's of S has finite length. Suppose the graph of u intersects the lines x = a 
and x = bat the points (a, c) and (b, d), and suppose that the graph of J intersects those 
lines at the points (a, e) and (6, f). Find a parameterization y : [a’, b’| + Cg of the 
curve C's. 


HINT: Try using the interval [a, b+ d — f + b—a+c-—e|as the domain [a’, b'] of ». 


The next theorem, though simple to state and use, contains in its proof a combinatorial idea 
that is truly central to all that follows in this chapter. In its simplest form, it is just the 
realization that the line integral in one direction along a curve is the negative of the line 
integral in the opposite direction. 


Let S1,..., Sp be a collection of closed geometric sets that constitute a partition of a 
geometric set S, and assume that the boundaries of all the S;'s, as well as the boundary of S, 
have finite length. Suppose w is a continuous differential form on all the boundaries {Cg, }. 
Then 

Equation: 


We give a careful proof for a special case, and then outline the general argument. Suppose 
then that S' is a piecewise smooth geometric set, determined by the interval a, b| and the two 
bounding functions wu and J, and assume that the boundary C’g has finite length. Suppose 
m/(z) is a piecewise smooth function on [a, 6], satisfying i |m’| < oo, and assume that 
I(x) < m(ax) < u(z) for all x € (a,b). Let S; be the geometric set determined by the 
interval {a, b] and the two bounding functions m and J, and let Sz be the geometric set 
determined by the interval [a, b] and the two bounding functions u and m. We note first that 
the two geometric sets S; and Sy comprise a partition of the geometric set S, so that this is 
indeed a pspecial case of the theorem. 


Next, consider the following eight line integrals: First, integrate from left to write along the 
graph of m, second, up the line z = b from (b, m(b)) to (6, u(b)), third, integrate from right 
to left across the graph of u, fourth, integrate down the line x = a from (a, u(a)) to 

(a, m(a)), fifth, continue down the line z = a from (a, m(a)) to (a, l(a)), sixth, integrate 
from left to right across the graph of J, seventh, integrate up the line = b from (0, 1(b)) to 
(b, m(b)), and finally, integfrate from right to left across the graph of m. 


The first four line integrals comprise the line integral around the geometric set Sz, and the 
last four comprise the line integral around the geometric set S$). On the other hand, the first 
and eighth line integrals here cancel out, for one is just the reverse of the other. Hence, the 
sum total of these eight line integrals, integrals 2—7, is just the line integral around the 


boundary C's of S. Therefore 
Equation: 
/ w= / G2 = / Ww 
Cs Cs, Csy 


as desired. 


We give next an outline of the proof for a general partition Sj, ...,.S,, of S. Let S; be 
determined by the interval [a,, b,] and the two bounding functions u, and J,. Observe that, if 
the boundary C's, of S; intersects the boundary C's, of S;; in a curve C, then the line integral 
of w along C’,, when it is computed as part of integrating counterclockwise around Sj, is the 
negative of the line integral along C’, when it is computed as part of the line integral 
counterclockwise around S;. Indeed, the first line integral is the reverse of the second one. (A 
picture could be helpful.) Consequently, when we compute the sum of the line integrals of w 
around the C'g,'s, All terms cancel out except those line integrals that ar computed along parts 
of the boundaries of the S;'s that intersect no other S;. But such parts of the boundaries of 
the S$j,'s must coincide with parts of the boundary of S. Therefore, the sum of the line 
integrals of w around the boundaries of the S;'s equals the line integral of w around the 
boundary of S, and this is precisely what the theorem asserts. 

Exercise: 


Problem: 


Prove the analog of [link] for contour integrals: Let $1, ...,.S,, be a collection of closed 
geometric sets that constitute a partition of a geometric set S, and assume that the 
boundaries of all the S;,'s, as well as the boundary of S, have finite length. Suppose f is 
a continuous complex-valued function on all the boundaries {C'g, } as well as on the 
boundary C's. Then 

Equation: 


j fod=Sif Fae. 


k-1 4 Cs, 


We come now to the most remarkable theorem in the subject of integration over curves, 
Green's Theorem. Another fanfare, please! 
Green 


Let S be a piecewise smooth, closed, geometric set, let C's denote the closed curve that is the 
boundary of S,, and assume that Cg is of finite length. Suppose w = Pdz + Qdyisa 
continuous differential form on S$ that is smooth on the interior S° of S. Then 


Equation: 
tial tialP 
i, w= Pdr + Qay= [ aa = 2 
Cs Cs s tialx tialy 


REMARK The first thing to notice about this theorem is that it connects an integral around a 
(1-dimensional) curve with an integral over a (2-dimensional) set, suggesting a kind of 
connection between a 1-dimensional process and a 2-dimensional one. Such a connection 
seems to be unexpected, and it should therefore have some important implications, as indeed 
Green's Theorem does. 


The second thing to think about is the case when w is an exact differential df of a smooth 
function f of two real variables. In that case, Green's Theorem says 


Equation: 
tial f tialf 
d dy = ce J ry)s 
i. tialx = tialy e [iw Fey) 


which would be equal to 0 if f € C? (8), by [link]. Hence, the integral of df around any 
such curve would be 0. If U is an open subset of R?, there may or may not be some other w's, 
called closed differential forms, having the property that their integral around every piecewise 
smooth curve of finite length in U is 0, and the study of these closed differential forms w that 
are not exact differential forms df has led to much interesting mathematics. It turns out that 
the structure of the open set U, e.g., how many “holes” there are in it, is what's important. 
Take a course in Algebraic Topology! 


The proof of Green's Theorem is tough, and we break it into several steps. 
Suppose S is the rectangle |a, b] x |c, d]. Then Green's Theorem is true. 


We think of the closed curve C's bounding the rectangle as the union of four straight lines, 
C, C2, C3 and C4, and we parameterize them as follows: Let vy : [a, b] + C be defined by 
y(t) = (t,c); let y: [b,b +d —c] — C4 be defined by y(t) = (b,t — b +c); let 
y:(b+d—c,b+d—c+b—al| > Cs be defined by y(t) = (b+ d—c+b-—t,d); and 
let py: [b+d—c+b—a,b+d—c+b—a+d-—c| — C4 be defined by 

y(t) = (a,b +d—c+b—a+d-—t). One can check directly to see that this y 
parameterizes the boundary of the rectangle S = |a, b] x [c,d]. 


As usual, we write y(t) = (x(t), y(t)). Now, we just compute, use the Fundamental 
Theorem of Calculus in the middle, and use part (d) of [link] at the end. 
Equation: 


Lae 
ery 


= [ Par edys f Pdzx+Qdy 
C1 
+f ek. Pdzr+Qdy 
Cs C4 


z [Peet 


Q (y (t))y" (t) at 


P(b+d—c+b-—t,d)(—1)dt 


i, 
b+d—c+b—a+d—c 
[ 


Q(a,b+d—c+b—a+d-—t)(-1)dt 
+d—c+b—a 


[ros Poona 
- [ Pedad—f aanae 
[ewo- Q(a, par— (Pea) — Pte) at 
zs i. i oe s,t) dade 
“f [tae a 


hs / tialQ  tialP 
od g\ tialx tialy ’ 


proving the lemma. 


Suppose S is a right triangle whose vertices are of the form (a, c), (b, c) and (b, d). Then 
Green's Theorem is true. 


We parameterize the boundary C's of this triangle as follows: For t € |a, bj, set y(t) = (t, c); 
fort € [b,b +d —cl, set y(t) = (b,t +c — b); and fort € [b+ d—c,b+d—c+b-—al, 
set p(t) = (b+ d—c+b—t,b+d—c+d —t). Again, one can check that this y 
parameterizes the boundary of the triangle S. 


Write y(t) = (a(t), y(t)). Again, using the Fundamental Theorem and [link], we have 
Equation: 
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which proves [link]. 


Suppose 5S}, ..., S, is a partition of the geometric set S, and that the boundary C's, has finite 
length for all 1 < k < n. If Green's Theorem holds for each geometric set S;, then it holds 
for S. 


From [link] we have 
Equation: 


n 
fe=-v] 5 
Cs k=1 4 Cs, 


and from [link] we have 
Equation: 


[a-r=d f e- Py 


Since Green's Theorem holds for each k, we have that 
Equation: 


and therefore 
Equation: 


as desired. 
Exercise: 


Problem: 


a. Prove Green's Theorem for a right triangle with vertices of the form (a,c), (b,c), 
and (a, d). 

b. Prove Green's Theorem for a trapezoid having vertices of the form (a,c), (b,c), 
(b, d), and (a, e), where both d and e are greater than c. HINT: Represent this 
trapezoid as the union of a rectangle and a right triangle that share a border. Then 
use [link]. 

c. Prove Green's Theorem for S' any quadrilateral that has two vertical sides. 

d. Prove Green's Theorem for any geometric set S whose upper and lower bounding 
functions are piecewise linear functions. HINT: Show that S can be thought of as a 
finite union of quadrilaterals, like those in part (c), each one sharing a vertical 
boundary with the next. Then, using induction and the previous exercise finish the 
argument. 


We need one final lemma before we can complete the general proof of Green's Theorem. This 
one is where the analysis shows up; there are carefully chosen e's and 6's. 


Suppose S is contained in an open set U and that w is smooth on all of U. Then Green's 
Theorem is true. 


Let the piecewise smooth geometric set S be determined by the interval |a, b] and the two 
bounding functions wu and J. Using [link], choose an r > 0 such that the neighborhood 
N,(S) C U. Now let € > 0 be given, and choose delta to satisfy the following conditions: 


a. 0 < r/2, from which it follows that the open neighborhood N; (S) is a subset of the 
compact set N,/2 (S). (See part (f) of [link].) 

b. 6 < €/4M, where M is a common bound for all four continuous functions 
|P|, |Q|, |Py|, and |Q.| on the compact set NV,/2 (S). 

c.6 < €/4M(b—a). 

d. 6 satisfies the conditions of [link]. 


Next, using [link], choose two piecewise linear functions p,, and p; so that 


1. |u(x)—p, (x)| < 6/2 for all x € [a, 5]. 
2. |l(x)—p; (x)| < 6/2 for all x € |a, b]. 
3. f° jul (x) — pl, (w)| da < 6. 

4. f° \l' (x) — pi (2)| de <6. 


Let S$ be the geometric set determined by the interval [a, b] and the two bounding functions 7 


and 1, where @ = p, + 6 /2 and i=p, —6 /2. We know that both &@ and i are piecewise 
linear functions. We have to be a bit careful here, since for some z's it could be that 
Pu (x) < pi (a). Hence, we could not simply use p,, and p; themselves as bounding functions 


for S. We do know from (1) and (2) that u (a) < @(zx) and1(x) > (a), which implies that 
the geometric set S is contained in the geometric set S. Also Sis a Subset of the 
neighborhood NN; (s), which in turn is a subset of the compact set N;./2 (S). 


Now, by part (d) of the preceding exercise, we know that Green's Theorem holds for S. That 


[ o= [@e-P) 


Equation: 

We will show that Green's Theorem holds for S by showing two things: (i) 

os — fo. w| < 4e, and (ii) fe (Q:) — Py) — fg (Qe - P,)| < e€. We would then have, 
Ss 

by the usual adding and subtracting business, that 


Equation: 


[o- [@e- Pal< se 


and, since e€ is an arbitrary positive number, we would obtain 


Equation: 
[ =f @-P. 
Cs S 


Let us estabish (i) first. We have from (1) above that |u (x)—w(a)| < 6 for all x € |a, bj, and 
from (3) that 
Equation: 


Hence, by [link], 
Equation: 


; Ww — / w| <e. 
graph(u) graph (7) 


Similarly, using (2) and (4) above, we have that 
Equation: 


Also, the difference of the line integrals of w along the righthand boundaries of S and Sis 
less than e. Thus 
Equation: 


i ‘awneat- f° Q (b, t) dt| 


@i(b) (b) 

< [ Q(b,t)at|+| [  Q(b,t) ae 
u(b) i(b) 

< M 7 (b) —2(6)| + |u(®) - @()) 

< M(6+5) 

— 2M 
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Of course, a similar calculation shows that 
Equation: 
(a,l(a)) (a,l(a)) 
/ w — / wl <e. 
C'(a,u(a)) C (a,ui(a)) 
These four line integral inequalities combine to give us that 


Equation: 
/ w- | w| < de, 
Cs 5 


establishing (i). 


Finally, to see (ii), we just compute 
Equation: 


0 < [Q-Pa- f@s-Pa 


b a(t) ee 
_ / I. (2 (Gs) —Py (he) aad / [ |, (Qs (b8) ~ Pat) dat 


b p(t) b pu(t) 
- / - (Qz (t, 8) — P, (t,8)) dsdt + i / (Qe (t,8) — Py (t,2)) dade| 
a Ji(t) a Ju(t) 


< 2u( [ L(t) — I (t)| + |@(t) — u(t)| dt 
< 4M6(b—a) 
< 6. 


This establishes (ii), and the proof is complete. 


At last, we can finish the proof of this remarkable result. 
PROOF OF GREEN'S THEOREM 


As usual, let S be determined by the interval [a, b] and the two bounding functions u and J. 
Recall that u(x) — I(x) > 0 for all x € (a,b). For each natural number n > 2, let S,, be the 
geometric set that is determined by the interval [a + 1/n, b — 1/n] and the two bounding 
functions uy, and 1,, where un = u — (u — 1) /n restricted to the interval 
[a+1/n,b—1/n], andl, =1+ (u —1)/n restricted to [a + 1/n, b — 1/n]. Then each S;, 
is a piecewise smooth geometric set, whose boundary has finite length, and each S;, is 
contained in the open set S'° where by hypothesis w is smooth. Hence, by Lemma 4, Green's 
Theorem holds for each S,,. Now it should follow directly, by taking limits, that Green's 
Theorem holds for S. In fact, this is the case, and we leave the details to the exercise that 
follows. 

Exercise: 


Problem: Let S,w, and the S,,'s be as in the preceding proof. 


a. Using [link], show that 
Equation: 


/ w =lim Ww. 
Cs C'sp, 


b. Let f be a bounded integrable function on the geometric set S. Prove that 


Equation: 
/ f =lim / 2 
S Sn 


c. Complete the proof to Green's Theorem; i.e., take limits. 


REMARK Green's Theorem is primarily a theoretical result. It is rarely used to “compute” a 
line integral around a curve or an integral of a function over a geometric set. However, there 
is one amusing exception to this, and that is when the differential form w = a dy. For that 
kind of w, Green's Theorem says that the area of the geometric set S can be computed as 


follows: 
A(s)= f= Bs pga 
3 s tialx Cs 


Equation: 
This is certainly a different way of computing areas of sets from the methods we developed 
earlier. Try this way out on circles, ellipses, and the like. 


The Fundamental Theorem of Algebra, and the Fundamental Theorem of 
Analysis 

An introduction to the fundamental theorem of algebra, with links to 
important theorems on the topic. 


In this chapter we will discover the incredible difference between the 
analysis of functions of a single complex variable as opposed to functions 
of a single real variable. Up to this point, in some sense, we have treated 
them as being quite similar subjects, whereas in fact they are extremely 
different in character. Indeed, if is a differentiable function of a complex 
variable on an open set then we will see that is actually 
expandable in a Taylor series around every point in _In particular, a 
function of a complex variable is guaranteed to have infinitely many 
derivatives on if itmerely has the firstoneon This is in marked 
contrast with functions of a real variable. See part (3) of [Link]. 


The main points of this chapter are: 


. The Cauchy-Riemann Equations ([link]), 

. Cauchy's Theorem ((link]), 

. Cauchy Integral Formula ((link]), 

A complex-valued function that is differentiable on an open set is 
expandable in a Taylor series around each point of the set ({link]), 
. The Identity Theorem ([link]), 

. The Fundamental Theorem of Algebra ((link]), 

. Liouville's Theorem (([link]), 

The Maximum Modulus Principle (corollary to [link]), 

. The Open Mapping Theorem ((link)), 

. The uniform limit of analytic functions is analytic ({link]), and 

. The Residue Theorem ((link]). 
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Cauchy's Theorem 
A module containing theorems and exercises leading up to Cauchy's 
theorem. 


We begin with a simple observation connecting differentiability of a 
function of a complex variable to a relation among of partial derivatives of 
the real and imaginary parts of the function. Actually, we have already 
visited this point in [link]. 

Cauchy-Riemann equations 


Let f = u+ iv be a complex-valued function of a complex variable 
z=a2+iy = (z,y), and suppose f is differentiable, as a function of a 
complex variable, at the point c = (a, b). Then the following two partial 
differential equations, known as the Cauchy-Riemann Equations, hold: 
Equation: 


tialu tialv 
— (a,b) = ——(a,b 
tial a?) tialy (2,5), 
and 
Equation: 
tialu tialv 
tialy a0 tial ae): 


We know that 
Equation: 


f(e +h) — fle) 


f' (c) =lim h > 0 h ; 


and this limit is taken as the complex number h approaches 0. We simply 
examine this limit for real h's approaching 0 and then for purely imaginary 
h's approaching 0. For real h's, we have 

Equation: 


f'(c) = f'(a+ib) 
_ tm Seth +t) — f(a+ ib) 


h-0 h 
= limhs i 
— Yim WG +h.d)—ula,d) | a wat h,b) — (a,b) 
h-0 h h—>0 h 
Lal tial 
= =, b) +i——~ (a,b). 
tialz tialx 


For purely imaginary h's, which we write as h = 1k, we have 
Equation: 

f'(c) = fi (at) 
f(a+i(b+k)) — f(a + ib) 


= lim 
k-0 tk 
_ u(a,b+k)+iv(a,b+k) — u(a, b) — iv(a, b) 
Sim St 
k-0 ik 
u(a,b+k)—u(a,b)  v(a,b+k) — v(a,b) 
= 6 im 2ST. $= 
k-+0 k k 
_ tialu tialv 
ti ly a9) tial G50) 


Equating the real and imaginary parts of these two equivalent expressions 
for f’ (c) gives the Cauchy-Riemann equations. 


As an immediate corollary of this theorem, together with Green's Theorem 
({link]), we get the following result, which is a special case of what is 
known as Cauchy's Theorem. 


Let S be a piecewise smooth geometric set whose boundary C'g has finite 
length. Suppose f is a complex-valued function that is continuous on S and 


differentiable at each point of the interior S° of S. Then the contour 
integral Jo. 7 (Chde = 0, 


Exercise: 


Problem: 


a. Prove the preceding corollary. See [link]. 

b. Suppose f = u + 2v is a differentiable, complex-valued function 
on an open disk B,. (c) in C, and assume that the real part u is a 
constant function. Prove that f is a constant function. Derive the 
same result assuming that v is a constant function. 

c. Suppose f and g are two differentiable, complex-valued functions 
on an open disk B,. (c) in C’. Show that, if the real part of f is 
equal to the real part of g, then there exists a constant & such that 
f(z) = g(z) +k, for all z € B, (c). 


For future computational purposes, we give the following implications of 
the Cauchy-Riemann equations. As with [link], this next theorem mixes the 
notions of differentiability of a function of a complex variable and the 
partial derivatives of its real and imaginary parts. 


Let f = u+ iv be a complex-valued function of a complex variable, and 
suppose that f is differentiable at the point c = (a,b). Let A be the 2 x 2 
matrix 

Equation: 


A = (ux (a, b)vx (a, b)uy (a, b)vy (a,b). 


Then: 


1. | f'(c)|? =det (A). 
2. The two vectors 
Equation: 


+> + 
Vi = (uz (a,b), uy (a, b)) and V2 = (vz (a, b), vy (a, b)) 


are linearly independent vectors in R? if and only if f’ (c) 4 0. 
3. The vectors 
Equation: 


= > 
Ve = (ue (a, 6), 2 (a,b) and Vy = (2ty (a,b), vy (a,b) 
are linearly independent vectors in R? if and only if f’ (c) 4 0. 


Using the Cauchy-Riemann equations, we see that the determinant of the 
matrix A is given by 
Equation: 

det A = uz (a, b)vy (a, b) — uy (a, b)vz (a, b) 


(uz (a,b))” + (ve (a, b))” 
(ux (a,b) + ive (a, b)) (uz (a, b) — ivz (a, b)) 


| 


proving part (1). 


The vectors V; and Vy are the columns of the matrix A, and so, from 
elementary linear algebra, we see that they are linearly independent if and 
only if the determinant of A is nonzero. Hence, part (2) follows from part 
(1). Similarly, part (3) is a consequence of part (1). 


It may come as no surprise that the contour integral of a function f around 
the boundary of a geometric set S is not necessarily 0 if the function f is 
not differentiable at each point in the interior of S. However, it is exactly 
these kinds of contour integrals that will occupy our attention in the rest of 
this chapter, and we shouldn't jump to any conclusions. 

Exercise: 


Problem: 


Let c be a point in C’,, and let S be the geometric set that is a closed 
disk B, (c). Let y be the parameterization of the boundary C,, of S$ 
given by y(t) =c + re” fort € [0, 27]. For each integer n € Z, 
define fp (z) = (z— cc)”. 


a. Show that Jo fn(¢d¢ = 0 for alln # —1. 


b. Show that 
Equation: 


[ ta@ac= fae = 2m 


There is a remarkable result about contour integrals of certain functions that 
aren't differentiable everywhere within a geometric set, and it is what has 
been called the Fundamental Theorem of Analysis, or Cauchy's Theorem. 
This theorem has many general statements, but we present one here that is 
quite broad and certainly adequate for our purposes. 

Cauchy's Theorem, Fundamental Theorem of Analysis 


Let S be a piecewise smooth geometric set whose boundary C’g has finite 
length, and let S C S$ ° be a piecewise smooth geometric set, whose 


— 


boundary C’g also is of finite length. Suppose f is continuous on SS Y 
i.e., at every point z that is in S but not in S°, and assume that f is 


differentiable on S°7 $ , ie., at every point z in S° but not in gS. (We think 
of these sets as being the points “between” the boundary curves of these 
geometric sets.) Then the two contour integrals [ Cs f (¢) d¢ and 


J Cs f (©) d¢ are equal. 


Let the geometric set S be determined by the interval [a, b] and the two 


bounding functions u and J, and let the geometric set S be determined by 


the subinterval a, i of [a, b] and the two bounding functions w@ and i. 


Because S C §$°, we know that @(t) < u(t) and l(t) < (t) for all 


E a, i . We define four geometric sets $1, ..., 54 as follows: 


1. S$ is determined by the interval la, a| and the two bounding 
functions u and / restricted to that interval. 


2. S» is determined by the interval a, i and the two bounding 
functions u and ¥ restricted to that interval. 
3. Ss is determined by the interval a, i and the two bounding 


functions J and J restricted to that interval. 
4. S4 is determined by the interval 6, b and the two bounding functions 


uw and I restricted to that interval. 


Observe that the five sets $' 91, +--, 94 constitute a partition of the 
geometric set S. The corollary to [link] applies to each of the four 
geometric sets $1, ...,.54. Hence, the contour integral of f around each of 
the four boundaries of these geometric sets is 0. So, by [link], 

Equation: 


dC = ¢)d 
(a [£0 yo 7 
Cs 
as desired. 
Exercise: 
Problem: 


a. Draw a picture of the five geometric sets in the proof above and 
justify the claim that the sum of the four contour integrals around 


the geometric sets $1, ..., S4 is the integral around C’g minus the 
integral around C's. 


. Let $1, ..., Sn be pairwise disjoint, piecewise smooth geometric 
sets, each having a boundary of finite length, and each contained 
in a piecewise smooth geometric set S whose boundary also has 
finite length. Prove that the S;,'s are some of the elements of a 


partition {8 i} of S, each of which is piecewise smooth and has a 


boundary of finite length. Show that, by reindexing, Sj, ..., Sn 
can be chosen to be the first n elements of the partition {5 a 


HINT: Just carefully adjust the proof of [link]. 

. Suppose S is a piecewise smooth geometric set whose boundary 
has finite length, and let S},...,.S,, be a partition of S for which 
each S; is piecewise smooth and has a boundary C's, of finite 
length. Suppose f is continuous on each of the boundaries C's, of 
the S;'s as well as the boundary C's of S,, and assume that f is 
continuous on each of the S;'s, for 1 < k < m, and differentiable 
at each point of their interiors. Prove that 

Equation: 


Poac= > f seac. 


k=m-+1 Cs, 


. Prove the following generalization of the Cauchy Theorem: Let 
1, ..-, Sn be pairwise disjoint, piecewise smooth geometric sets 
whose boundaries have finite length, all contained in the interior 
of a piecewise smooth geometric set S whose boundary also has 
finite length. Suppose f is continuous at each point of S that is 
not in the interior of any of the $;'s, and that f is differentiable at 
each point of S° that is not an element of any of the S;'s. Prove 
that 

Equation: 


n 


Perhaps the main application of [link] is what's called the Cauchy Integral 
Formula. It may not appear to be useful at first glance, but we will be able 
to use it over and over throughout this chapter. In addition to its theoretical 
uses, it is the basis for a technique for actually evaluating contour integrals, 
line integrals, as well as ordinary integrals. 

Cauchy Integral Formula 


Let S be a piecewise smooth geometric set whose boundary C'g has finite 
length, and let f be a continuous function on S that is differentiable on the 
interior S° of S. Then, for any point z € S$ 0 we have 

Equation: 


REMARK This theorem is an initial glimpse at how differentiable 
functions of a complex variable are remarkably different from differentiable 
functions of a real variable. Indeed, Cauchy's Integral Formula shows that 
the values of a differentiable function f at all points in the interior of a 
geometric set S are completely determined by the values of that function on 
the boundary of the set. The analogous thing for a function of a real variable 
would be to say that all the values of a differentiable function f at points in 
the open interval (a, b) are completely determined by its values at the 
endpoints a and b. This is patently absurd for functions of a real variable, so 
there surely is something marvelous going on for differentiable functions of 
a complex variable. 


Let r be any positive number such that B , (z) is contained in the interior 
S° of S, and note that the close disk B,. (z) is a piecewise smooth 
geometric set S contained in S°. We will write C,, instead of C g for the 


boundary of this disk, and we will use as a parameterization of the curve C, 
the function y : [0,27] > C, given by y (t) = z+ re”. Now the function 


— 


9(C) = f(0)/(¢ — z) is continuous on $9 §° and differentiable on 


S°n S , so that [link] applies to the function g. Hence 
Equation: 


Lr 1 
ra ees — [aac 


1 
7 sal 9 (0), dc 


1 
1 f 10 x 
277i Jo. GC —2 
1 2 f(z+re% . 
= Se RiCauiad ; ) ire” dt 
2ri Jo ztre*—z 
1 2a ‘ 
= z+re) dt. 
7 ) 


Since the equality established above is valid, independent of r, we may take 
the limit as r goes to 0, and the equality will persist. We can evaluate such a 
limit by replacing the r by 1/n, in which case we would be evaluating 
Equation: 


1 2a 1 , 1 2a 
lim — f{(z+—e” )dt=lim — fr (t) dt, 
0 n 


n—-oo aT 0 


where f, (t) = f (z+ fraclne”). Finally, because the function f is 
continuous at the point z, it follows that the sequence { f,, } converges 
uniformly to the constant function f(z) on the interval [0, 27]. So, by 
Theorem 5.6, we have that 

Equation: 


Therefore, 


Equation: 
1 F(G ae, tO it 
— —] mi— t = 
a ea lim = f (z+re™) dt = f (2), 


and the theorem is proved. 


The next exercise gives two simple but strong consequences of the Cauchy 
Integral Formula, and it would be wise to spend a few minutes deriving 
other similar results. 

Exercise: 


Problem: 


a. Let S and f be as in the preceding theorem, and assume that 
f(z) = 0 for every point on the boundary C'g of S. Prove that 
f(z) = 0 for every z € S. 

b. Let S be as in part (a), and suppose that f and g are two 
continuous functions on S,, both differentiable on S°, and such 
that f(¢) = g(¢) for every point on the boundary of S. Prove that 
f(z) = g(z) forall z € S. 


The preceding exercise shows that two differentiable functions of a 
complex variable are equal everywhere on a piecewise smooth geometric 
set S if they agree on the boundary of the set. More is true. We will see 
below in the Identity Theorem that they are equal everywhere on a 
piecewise smooth geometric set S if they agree just along a single 
convergent sequence in the interior of S. 


Combining part (b) of [link], [link], and [link], we obtain the following 
corollary: 


Let Sj, ..., S, be pairwise disjoint, piecewise smooth geometric sets whose 
boundaries have finite length, all contained in the interior of a piecewise 


smooth geometric set S whose boundary has finite length. Suppose f is 
continuous at each point of S that is not in the interior of any of the S;'s, 
and that f is differentiable at each point of 9° that is not an element of any 
of the S;,'s. Then, for any z € 9° that is not an element of any of the S;'s, 
we have 

Equation: 


1 
f= (fz IO a 3 ee 10a), 


Let r > 0 be such that B, (z) is disjoint from all the S;,'s. By part (b) of 
[link], let 7), ..., Zi, be a partition of S such that 7; = S; forl<k <n, 
and T;,41 = B, (z). By [link], we know that 


Equation: 
[ene ® "2h ts 


From the Cauchy Integral Formula, we know that 


Equation: 
[. c= (Sig = anif (2). 


Also, since f(¢)/(¢ — z) is differentiable at each point of the interior of the 
sets JT, fork > n+ 1, we have from [link] that for allk >n+1 
Equation: 


Therefore, 


Equation: 


la a=y fz LO ae 4 oni nif (2), 


which completes the proof. 
Exercise: 


Problem: 
Suppose S is a piecewise smooth geometric set whose boundary has 
finite length, and let ci, ..., Cn be points in S°. Suppose f is a 


complex-valued function that is continuous at every point of S' except 
the C;'s and differentiable at every point of S° except the c;'s. Let 


11, ---) Tn be positive numbers such that the disks {B R, (cx) } are 


pairwise disjoint and all contained in S°. 


a. Prove that 
Equation: 


fOG=S | Ou 


where C;, denotes the boundary of the disk B,., (cx). 
b. For any z € S° that is not in any of the closed disks B,, (cx), 


show that 
Equation: 
. ‘ f(¢) 
19) 4 = dé. 
Le C= anit (a) +0 f =o, ¢ 


c. (c) Specialize part (b) to the case where S' = B, (c), and f is 
analytic at each point of B, (c) except at the central point c. For 
each z ~ cin B, (c), and any 0 < 6 < |z — cl, derive the 
formula 


Equation: 


Hit MOig.. SF SO 


«Oni de C-z > @ile,C-z 


dc. 


Basic Applications of the Cauchy Integral Formula 

As a major application of the Cauchy Integral Formula, let us show the 
much alluded to remarkable fact that a function that is a differentiable 
function of a complex variable on an open set U is actually expandable in a 
Taylor series around every point in U, i.e., is an analytic function on U. 


As a major application of the Cauchy Integral Formula, let us show the 
much alluded to remarkable fact that a function that is a differentiable 
function of a complex variable on an open set U is actually expandable in a 
Taylor series around every point in U, i.e., is an analytic function on U. 


Suppose f is a differentiable function of a complex variable on an open set 
U CC , and let c be an element of U. Then f is expandable in a Taylor 


series around c. In fact, for any r > 0 for which B, (c) C U, we have 
Equation: 


for all z € B, (c). 


Choose an r > 0 such that the closed disk B, (c) C U, and write C, for 
the boundary of this disk. Note that, for all points ¢ on the curve C’,, and 
any fixed point z in the open disk B,. (c), we have that 


jz —c| <r = |¢ —cl, whence |z — c|/|¢ — c| = |z — e|/r < 1. Therefore 
the geometric series 
Equation: 


CO n 
S° os converges to ee 
¢ —e 1 z—c * 


n=0 C-c 


Moreover, by the Weierstrass /-Test, as functions of the variable ¢, this 
infinite series converges uniformly on the curve C’,. We will use this in the 
calculation below. Now, according to [link], we have that 

Equation: 


i) = 1 ff f© de 


2771 c,G-2 


et f(Q) 
7 ae ee Ta 


tpt FQ &fz-e\" 
= oe chs) . 


_ i a O) _ ayn 
os ae 


_ 227 70 
— mf, (z—c)"d¢ 


= (¢ _ ov 
= Sra,(2-0)", 
n=0 


where we are able to bring the summation sign outside the integral by part 
(3) of [link], and where 
Equation: 


_ iil f(Q) 
an = Oni g (¢-o™ dc. 


This proves that f is expandable in a Taylor series around the point c, as 
desired. 


Using what we know about the relationship between the coefficients of a 
Taylor series and the derivatives of the function, together with the Cauchy 
Integral Theorem, we obtain the following formulas for the derivatives of a 
differentiable function f of a complex variable. These are sometimes also 
called the Cauchy Integral Formulas. 


Suppose f is a differentiable function of a complex variable on an open set 
U, and let c be an element of U. Then f is infinitely differentiable at c, and 
Equation: 


f(a f MO «, 


_ Pra) C, (¢ _ oc)" 


for any piecewise smooth geometric set S C U whose boundary C's has 
finite length, and for which c belongs to the interior S° of S. 
Exercise: 


Problem: 


a. Prove the preceding corollary. 

b. Let f, U, and c be as in [link]. Show that the radius of 
convergence r of the Taylor series expansion of f around c is at 
least as large as the supremum of all s for which B, (c) C U. 

c. Conclude that the radius of convergence of the Taylor series 
expansion of a differentiable function of a complex variable is as 
large as possible. That is, if f is differentiable on a disk B, (c), 
then the Taylor series expansion of f around c converges on all of 
B, (c). 

d. Consider the real-valued function of a real variable given by 
f (x) =1/ (1+ 2”). Show that f is differentiable at each real 
number «. Show that f is expandable in a Taylor series around 0, 
but show that the radius of convergence of this Taylor series is 
equal to 1. Does this contradict part (c)? 

e. Let f be the complex-valued function of a complex variable given 
by f (z) = 1/ (1+ 27). We have just replaced the real variable x 
of part (d) by a complex variable z. Explain the apparent 
contradiction that parts (c) and (d) present in connection with this 
function. 


Exercise: 


Problem: 


a. Let S' be a piecewise smooth geometric set whose boundary C's 
has finite length, and let f be a continuous function on the curve 
C5. Define a function F on S° by 


Equation: 
F@ =f fo de. 


Prove that F’ is expandable in a Taylor series around each point 
c € $°. Show in fact that F(z) = S*an(z—c)” forall zina 
disk B, (c) C 8°, where 

Equation: 


n! f(g) 


er ae cs (0) d¢ 
HINT: Mimic the proof of [link]. 

b. Let f and F’ be as in part (a). Is F’ defined on the boundary C's of 
S? If z belongs to the boundary C's, and z =lim z,,, where each 
Zn € S°, Does the sequence { F (z,,)} converge, and, if so, does 
it converge to f(z)? 

c. Let S be the closed unit disk B; (0), and let f be defined on the 
boundary C’; of this disk by f (z) = z,iie., f(a +iy) = x — iy. 
Work out the function F’ of part (a), and then re-think about part 
(b). 

d. Let f and F be as in part (a). If, in addition, f is continuous on all 
of S and differentiable on S°, show that F(z) = 27i f(z) for all 
z € §°. Think about this “magic” constant 277i. Review the proof 
of the Cauchy Integral Formula to understand where this constant 
comes from. 


[link] and [link] constitute what we called the “identity theorem” for 
functions that are expandable in a Taylor series around a point c. An even 
stronger result than that is actually true for functions of a complex variable. 
Identity Theorem 


Let f be a continuous complex-valued function on a piecewise smooth 
geometric set S, and assume that f is differentiable on the interior S° of S. 
Suppose {z;} is a sequence of distinct points in S ° that converges to a 
point c in S°. If f(z.) = 0 for every K, then f(z) = 0 for every z € S. 


It follows from [link] that there exists an r > 0 such that f(z) = 0 for all 
z € B, (c). Now let w be another point in S°, and let us show that f(w) 


must equal 0. Using part (f) of [link], let yp : a, i —> C’be a piecewise 
smooth curve, joining c to w, that lies entirely in S 0 Let A be the set of all 
te fa, i such that f(y(s)) = 0 for all s € [@,t). We claim first that A is 


nonempty. Indeed, because y is continuous, there exists an e > 0 such that 
ly (s) — cl = |p (s) — ¢ (@)|< rif |s—a|< e. Therefore f(y(s)) = 0 for 
alls € la, a+ €), whence, @ + € € A. Obviously, A is bounded above by 


b, and we write to for the supremum of A. We wish to show that tp = b, 
whence, since y is continuous at B,f (w) = f (¢ (6) ) =F (ory) = 0. 


Suppose, by way of contradiction, that tg < b, and write z9 = y (to). Now 
zo € S°, and zp =lim y (ty — 1/k) because y is continuous at to. But 
f(y (to — 1/k)) = 0 for all k. So, again using [link], we know that there 
exists an r’ > 0 such that f(z) = 0 for all z € B,. (zo). As before, because 
y is continuous at tg, there exists a 6 > 0 such that tj + 6 < b and 

ly (s) — vy (to) |< 7’ if |s—to|< 6. Hence, f(y(s)) = 0 for all 

s € (to — 6, tp + 6), which implies that to + 6 belongs to A. But then to 
could not be the supremum of A, and therefore we have arrived at a 


contradiction. Consequently, tg = 6, and therefore f(w) = 0 for all 

w € §S°. Of course, since every point in S is a limit of points from S°, and 
since f is continuous on S, we see that f(z) = 0 for all z € S, and the 
theorem is proved. 


The next exercise gives some consequences of the Identity Theorem. Part 
(b) may appear to be a contrived example, but it will be useful later on. 
Exercise: 


Problem: 


a. Suppose f and g are two functions, both continuous on a 
piecewise smooth geometric set S and both differentiable on its 
interior. Suppose {z,} is a sequence of elements of S ° that 
converges to a point c € §°, and assume that f (z,) = g (zx) for 
all k. Prove that f(z) = g(z) for all z € S. 

b. Suppose f is a nonconstant differentiable function defined on the 
interior of a piecewise smooth geometric set 9. If c € S° and 
B. (c) C S°, show that there must exist an 0 < r < € for which 
f(c) 4 f(z) for all z on the boundary of the disk B, (c). 


The Fundamental Theorem of Algebra 
We can now prove the Fundamental Theorem of Algebra, the last of our 
primary goals. One final trumpet fanfare, please! 


We can now prove the Fundamental Theorem of Algebra, the last of our 
primary goals. One final trumpet fanfare, please! 
Fundamental Theorem of Algebra 


Let p(z) be a nonconstant polynomial of a complex variable. Then there 
exists a complex number Zo such that p(zo) = 0. That is, every nonconstant 
polynomial of a complex variable has a root in the complex numbers. 


We prove this theorem by contradiction. Thus, suppose that p is a 
nonconstant polynomial of degree n > 1, and that p(z) is never 0. Set 
f(z) = 1/p(z), and observe that f is defined and differentiable at every 
point z € C’. We will show that f is a constant function, implying that 

p = 1/f is aconstant, and that will give the contradiction. We prove that f 
is constant by showing that its derivative is identically 0, and we compute 
its derivative by using the Cauchy Integral Formula for the derivative. 


From part (4) of [link], we recall that there exists a B > 0 such that 


Jal )z|"" < |p (z)], for all z for which |z| > B, and where c,, is the 
7(i2h= a for 


all |z| > B, where we write M for 2/|c,,|. Now, fix a point c € C’. Because 
f is differentiable on the open set U = C’, we can use the corollary to [link] 
to compute the derivative of f at c by using any of the curves C’, that bound 
the disks B, (c), and we choose an r large enough so that |c + re”|> B for 
allO < ¢ < 27. Then, 

Equation: 


(nonzero) leading coefficient of the polynomial p. Hence, 


1 f _£© 
ami I, ¢-oF 


1/2" f(ce+re” ; 
= — / BOE) ge dt 
2m \Jo (c+ ret —c) 
a iil : 
< c+ re” )| dt 
< filet re) 
1 f" M 
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Hence, by letting r tend to infinity, we get that 
Equation: 


M 
'(c)| <li —0, 
f'(e)| Slim — 


and the proof is complete. 
REMARK 


The Fundamental Theorem of Algebra settles a question first raised back in 
[link]. There, we introduced a number J that was a root of the polynomial 
x? + 1. We did this in order to build a number system in which negative 
numbers would have square roots. We adjoined the “number” 7 to the set of 
real numbers to form the set of complex numbers, and we then saw that in 
fact every complex number z has a square root. However, a fear was that, in 
order to build a system in which every number has an nth root for every n, 
we would continually need to be adjoining new elements to our number 
system. However, the Fundamental Theorem of Algebra shows that this is 
not necessary. The set of complex numbers is already rich enough to 
contain all nth roots and even more. 


Practically the same argument as in the preceding proof establishes another 
striking result. 
Liouville 


Suppose f is a bounded, everywhere differentiable function of a complex 
variable. Then f must be a constant function. 
Exercise: 


Problem: Prove Liouville's Theorem. 


The Maximum Modulus Principle 

Our next goal is to examine so-called “max/min” problems for coplex- 
valued functions of complex variables. Since order makes no sense for 
complex numbers, we will investigate max/min problems for the absolute 
value of a complex-valued function. For the corresponding question for 
real-valued functions of real variables, we have as our basic result the First 
Derivative Test. Indeed, when searching for the poinhts where a 
differentiable real-valued function f on an interval |a, 6] attains its extreme 
values, we consider first the poinhts where it attains a local max or min. Of 
course, to find the absolute minimum and maximum, we must also check 
the values of the function at the endpoints. 


Our next goal is to examine so-called “max/min” problems for coplex- 
valued functions of complex variables. Since order makes no sense for 
complex numbers, we will investigate max/min problems for the absolute 
value of a complex-valued function. For the corresponding question for 
real-valued functions of real variables, we have as our basic result the First 
Derivative Test ((link]). Indeed, when searching for the poinhts where a 
differentiable real-valued function f on an interval |a, 6] attains its extreme 
values, we consider first the poinhts where it attains a local max or min, to 
which purpose end [link] is useful. Of course, to find the absolute minimum 
and maximum, we must also check the values of the function at the 
endpoints. 


An analog of [link] holds in the complex case, but in fact a much different 
result is really valid. Indeed, it is nearly impossible for the absolute value of 
a differentiable function of a complex variable to attain a local maximum or 
minimum. 


Let f be a continuous function on a piecewise smooth geometric set S, and 
assume that f is differentiable on the interior S° of S. Suppose c is a point 
in S° at which the real-valued function | f| attains a local maximum. That 
is, there exists an € > 0 such that |f(c)| > |f(z)| for all z satisfying 

|}z — c| < e. Then f is a constant function on S; i.e., f(z) = f(c) for all 

z € S. In other words, the only differentiable functions of a complex 
variable, whose absolute value attains a local maximum on the interior of a 
geometric set, are constant functions on that set. 


If f(c) = 0, then f(z) = 0 for all z € B, (c). Hence, by the Identity 
Theorem ((link]), f(z) would equal 0 for all z € S. so, we may as well 
assume that f(c) 4 0. Let r be any positive number for which the closed 
disk B, (c) is contained in B, (c). We claim first that there exists a point z 
on the boundary C’, of the disk B,. (c) for which | f(z)| = |f(c)|. Of 
course, | f(z| < |f(c)| for all z on this boundary by assumption. By way of 
contradiction, suppose that | f(¢)| < |f(c)| for all ¢ on the boundary C,, of 
the disk. Write M/ for the maximum value of the function | f| on the 
compact set C’,. Then, by our assumption, M < |f(c)|. Now, we use the 
Cauchy Integral Formula: 

Equation: 
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and this is a contradiction. 


Now for each natural number n for which 1/n < e, let z,, be a point for 
which |z, — c| = 1/n and |f (z,)|= |f (c)|. We claim that the derivative 
f' (Zn) of f at 2, = 0 for all n. What we know is that the real-valued 
function F (x, y) = |f (x + iy)|’= (u(a,y)? + (v(a, y))? attains a local 
maximum value at Zz, = (%n, Yn). Hence, by [link], both partial derivatives 
of F' must be 0 at (an, yn). That is 

Equation: 


tial tial 
and 
Equation: 
tialu tialv 
2u (a5. Yn) a (Ln; Yn) + 2v (2, Yn) a (Ss Yn) = 0. 
tialy tialy 
Hence the two vectors 
Equation: 
a tialu tialv 
V, = : ( Daatin)s —— (Zn, Yn) 
tialx tialx 
and 
Equation: 
—> tialu tialv 
Vo = : (fas ta); —— (Ln, Yn) 
tialy tialy 


— 
are both perpendicular to the vector V3 = (uw (2p, Yn); U(2n5 Yn))- But 


> > > > 
V3 4 0, because || V3 ||= |f (zn)|= |f (c)|> 0, and hence V; and V2 are 
linearly dependent. But this implies that f’ (z,) = 0, according to [link]. 


Since c =lim z,, and f’ is analytic on 9°, it follows from the Identity 
Theorem that there exists an r > 0 such that f’ (z) = 0 for all z € B, (c). 
But this implies that f is a constant f(z) = f(c) forall z € B, (c). And 
thenm, again using the Identity Theorem, this implies that f(z) = f(c) for 
all z € S, which completes the proof. 


REMARK Of course, the preceding proof contains in it the verification 
that if | f| attains a maximum at a point c where it is differentiable, then 


f' (c) = 0. This is the analog for functions of a complex variable of [link]. 
But, [link] certainly asserts a lot more than that. In fact, it says that it is 
impossible for the absolute value of a nonconstant differentiable function of 
a complex variable to attain a local maximum. Here is the coup d'gras: 
Maximum Modulus Principle 


Let f be a continuous, nonconstant, complex-valued function on a 
piecewise smooth geometric set S, and suppose that f is differentiable on 
the interior S° of S'. Let M be the maximum value of the continuous, real- 
valued function | f| on S, and let z be a point in S for which |f(z)| = M. 
Then, z does not belong to the interior S° of S; it belongs to the boundary 
of S. In other words, | f| attains its maximum value only on the boundary of 
S. 


Exercise: 


Problem: 


a. Prove the preceding corollary. 

b. Let f be an analytic function on an open set U, and letc € U bea 
point at which | f| achieves a local minimum; i.e., there exists an 
€ > O such that | f(c)| < |f(z)| for all z © B, (c). Show that, if 
f(c) # 0, then f is constant on B, (c). Show by example that, if 
f(c) = 0, then f need not be a constant on B, (c). 

c. Prove the “Minimum Modulus Principle:” Let f be a nonzero, 
continuous, nonconstant, function on a piecewise smooth 
geometric set S, and let m be the minimum value of the function 
|f| on S. If z is a point of S$ at which this minimum value is 
atgtained, then z belongs to the boundary C's of S. 


The Open Mapping Theorem and the Inverse Function Theorem 

We turn next to a question about functions of a complex variable that is related to the 
Inverse Function Theorem. That result asserts, subject to a couple of hypotheses, that the 
inverse of a one-to-one differentiable function of a real variable is also differentiable. 
Since a function is only differentiable at points in the interior of its domain, it is 
necessary to verify that the point f(c) is in the interior of the domain f(S) of the inverse 
function f ~! before the question of differentiability at that point can be addressed. And, 
the peculiar thing is that it is this point about f(c) being in the interior of f(S) that is the 
subtle part. The fact that the inverse function is differentiable there, and has the 
prescribed form, is then only a careful « — 6 argument. For continuous real-valued 
functions of real variables, the fact that f(c) belongs to the interior of f(.S) boils down to 
the fact that intervals get mapped onto intervals by continuous functions, which is 
basically a consequence of the Intermediate Value Theorem. However, for complex- 
valued functions of complex variables, the situation is much deeper. For instance, the 
continuous image of a disk is just not always another disk, and it may not even be an 
open set. Well, all is not lost; we just have to work a little harder. 


We turn next to a question about functions of a complex variable that is related to [link], 
the Inverse Function Theorem. That result asserts, subject to a couple of hypotheses, that 
the inverse of a one-to-one differentiable function of a real variable is also differentiable. 
Since a function is only differentiable at points in the interior of its domain, it is 
necessary to verify that the point f(c) is in the interior of the domain f(S) of the inverse 
function f —! before the question of differentiability at that point can be addressed. And, 
the peculiar thing is that it is this point about f(c) being in the interior of f(S) that is the 
subtle part. The fact that the inverse function is differentiable there, and has the 
prescribed form, is then only a careful « — 6 argument. For continuous real-valued 
functions of real variables, the fact that f(c) belongs to the interior of f(S) boils down to 
the fact that intervals get mapped onto intervals by continuous functions, which is 
basically a consequence of the Intermediate Value Theorem. However, for complex- 
valued functions of complex variables, the situation is much deeper. For instance, the 
continuous image of a disk is just not always another disk, and it may not even be an 
open set. Well, all is not lost; we just have to work a little harder. 

Open Mapping Theorem 


Let S' be a piecewise smooth geometric set, and write U for the (open) interior S° of S. 
Suppose f is a nonconstant differentiable, complex-valued function on the set U. Then 
the range f(U)of f is an open subset of C. 


Let c be in U. Because f is not a constant function, there must exist an r > O such that 
f(c) 4 f(z) for all z on the boundary C, of the disk B, (c). See part (b) of [link]. Let zo 
be a point in the compact set C’,. at which the continuous real-valued function 

| f(z) — f(c)| attains its minimum value s. Since f(z) # f(c) for any z € C,, we must 
have that s > 0. We claim that the disk Bs/2 (f (c)) belongs to the range f(U) of f. This 


will show that the point f(c) belongs to the ihnterior of the set f(U), and that will finish 
the proof. 


By way of contradiction, suppose B,/2(f (c) is not contained in f(U),, and let 
w © Bs/2 (f (c)) be a complex number that is not in f(U). We have that 
|w — f(c)| < s/2, which implies that |w— f(z)| > s/2 for all z € C;. Consider the 
function g defined on the closed disk B, (c) by g(z) = 1/(w — f(z)). Then g is 
continuous on the closed disk B, (c) and differentiable on B, (c). Moreover, g is not a 
constant function, for if it were, f would also be a constant function on B,. (c) and 
therefore, by the Identity Theorem, constant on all of U, whichg is not the case by 
hypothesis. Hence, by the Maximum Modulus Principle, the maximum value of |g| only 
occurs on the boundary C,, of this disk. That is, there exists a point z’ € C,, such that 
lg (z)| < |g (2’)| for all z € B, (c). But then 
Equation: 
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s 8/2 |w—f(e)| lw flz)| ~ s 
which gives the desired contradiction. Therefore, the entire disk Bs/2 (f (c)) belongs to 


f(U), and hence the point f(c) belongs to the interior of the set f(U). Since this holds 
for any point c € U, it follows that f(U) is open, as desired. 


Now we can give the version of the Inverse Function Theorem for complex variables. 


Let S be a piecewise smooth geometric set, and suppose f : S — C is continuously 
differentiable at a point c = a + bi, and assume that f’ (c) 4 0. Then: 


1. There exists an r > 0, such that B, (c) C S, for which f is one-to-one on B, (c). 

2. f(c) belongs to the interior of f(S). 

3. If g denotes the restriction of the function f to B, (c), then g is one-to-one, g~? is 
differentiable at the point f(c), and g-! (f (c) =1/f' (c). 


Arguing by contradiction, suppose that f is not one-to-one on any disk B, (c). Then, for 
each natural number n, there must exist two points 2, = yp +7Yp and 2), = x), + iy!, 
such that |z, — c| < 1/n,|z}, — c| < 1/n, and f (zn) = f (z),). If we write f = u+ iv, 
then we would have that u (tn, Yn) — u(x}, y;,) = 0 for all n. So, by part (c) of [link], 
there must exist for each n a point (ee els such that (Pas In) is on the line segment 
joining z, and z},, and for which 

Equation: 
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Similarly, applying the same kind of reasoning to v, there must exist points (Z,, J) on 
the segment joining z,, to z/, such that 
Equation: 
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If we define vectors U,, and V,, by 


Equation: 
=> tialu tialu 
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and 
Equation: 


> —: 
then we have that both U,, and V,, are perpendicular to the nonzero vector 


= — 
((an — 2),), (Yn — y},))-. Therefore, U,, and V,, are linearly dependent, whence 
Equation: 


tialu /,. ,\tialu, . ,tialv,_ _,tialv,_ _ 
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Now, since both {%,, + ig, } and {%, + 1G} converge to the point c = a + ib, and the 
partial derivatives of u and v are continuous at c, we deduce that 
Equation: 
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Now, from [link], this would imply that f ' (c) = Q, and this is a contradiction. Hence, 
there must exist an r > 0 for which f is one-to-one on B,. (c), and this proves part (1). 


Because f is one-to-one on B, (c),f is obviously not a constant function. So, by the 
Open Mapping Theorem, the point f(c) belongs to the interior of the range of f, and this 
proves part (2). 


Now write g for the restriction of f to the disk B, (c). Then g is one-to-one. According 
to part (2) of [link], we can prove that g~* is differentiable at f (c) by showing that 
Equation: 


ao ~=«e=1@) Fo) 


That is, we need to show that, given an € > 0, there exists a 6 > O such that if 
0 < |z— f(c)| < 6 then 
Equation: 


g*(z)-g"*(f(o)) 1 
z— f(c) f'(¢) 


|<. 


First of all, because the function 1/w is continuous at the point f’ (c), there exists an 
e’ > O such that if |w—f’ (c)| < é’, then 
Equation: 


1 1 


Next, because f is differentiable at c, there exists a 6’ > 0 such that if 0 < |y—c| < 6’ 
then 
Equation: 


MYAKO _ pgi<e, 


Now, by [link], g~/ is continuous at the point f (c), and therefore there exists ad > 0 
such that if |z — f(c)| < 6 then 
Equation: 


g'(z)-g' F(ll <&. 


So, if |z — f(c)| < 6, then 
Equation: 


g (z)-e=l9* (2) -9 (FO) <%. 


But then, 
Equation: 


from which it follows that 
Equation: 


as desired. 


Uniform Convergence of Analytic Functions 
A brief module containing a theorem about uniform convergence of analytic 
functions. 


Part (c) of [link] gives an example showing that the uniform limit of a 
sequence of differentiable functions of a real variable need not be 
differentiable. Indeed, when thinking about uniform convergence of 
functions, the fundamental result to remember is that the uniform limit of 
continuous functions is continuous ({link]). The functions in [link] were 
differentiable functions of a real variable. The fact is that, for functions of a 
complex variable, things are as usual much more simple. The following 
theorem is yet another masterpiece of Weierstrass. 


Suppose U is an open subset of C’, and that {f,,} is a sequence of analytic 
functions on U that converges uniformly to a function f. Then f is analytic 
on U. That is, the uniform limit of differentiable functions on an open set U 
in the complex plane is also differentiable on U. 


Though this theorem sounds impressive and perhaps unexpected, it is really 
just a combination of [link] and the Cauchy Integral Formula. Indeed, let c 
be a point in U, and let r > 0 be such that B, (c) C U. Then the sequence 
{ fn} converges uniformly to f on the boundary C;, of this closed disk. 
Moreover, for any z € B,(c), the sequence { f, (¢)/ (¢ — z)} converges 
uniformly to f(¢)/(¢ — z) on C;,. Hence, by [link], we have 

Equation: 


f(z) = lim f, (2) 
1 fn (Q) 
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Hence, by part (a) of [link], f is expandable in a Taylor series around c, i.e., 
f is analytic on U. 


Isolated Singularities, and the Residue Theorem 
this module covers the residue theorem, removable and isolated singularities, some new 
developments from Cauchy's theorem, and related exercises. 


The first result we present in this section is a natural extension of [link]. However, as we 
shall see, its consequences for computing contour integrals can hardly be overstated. 


Let S be a piecewise smooth geometric set whose boundary Cg has finite length. Suppose 
C1, «++, Cn are distinct points in the interior S° of S, and that 71, ..., 7°, are positive numbers 


such that the closed disks {Bry (cx) } are contained in S° and pairwise disjoint. Suppose f 


is continuous on S \ UB,., (cx), i.e., at each point of S that is not in any of the open disks 
B,, (cz), and that f is differentiable on S° \ UB,,, (cg), i-e., at each point of S° that is not 
in any of the closed disks B,., (cz). Write C%, for the circle that is the boundary of the closed 
disk B,, (cx). Then 

Equation: 
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This is just a special case of part (d) of [link]. 


Let f be continuous on the punctured disk B’, (c), analytic at each point z in BY (c), and 
suppose f is undefined at the central point c. Such points c are called isolated singularities 
of f, and we wish now to classify these kinds of points. Here is the first kind: 


A complex number c is called a removable singularity of an analytic function f if there 
exists an r > O such that f is continuous on the punctured disk B’, (c), analytic at each 
point in Bi. (c), and lim,,. f (z) exists. 


Exercise: 


Problem: 


a. Define f(z) =sin z/z for all z 4 0. Show that 0 is a removable singularity of f. 

b. For z # c, define f(z) = (1— cos (z — c))/(z — c). Show that c is a removable 
singularity of f. 

c. For z # c, define f (z) = (1— cos (z — c))/(z — c)”. Show that c is still a 
removable singularity of f. 

d. Let g be an analytic function on B, (c), and set f(z) = (g(z) — g(c))/(z — ¢) for 
all z € Bi, (c). Show that c is a removable singularity of f. 


The following theorem provides a good explanation for the term “removable singularity.” 
The idea is that this is not a “true” singularity; it's just that for some reason the natural 
definition of f at c has not yet been made. 


Let f be continuous on the punctured disk BY, (c) and differentiable at each point of the open 
punctured disk By (c), and assume that c is a removable singularity of f. Define f by 
f (z) =f (2) forall z € B’ (c), and f (c) =lim,_,, f (z). Then 


1. f is analytic on the entire open disk B, (c), whence 
Equation: 


for all z € Bi, (c). 

2. For any piecewise smooth geometric set S C B, (c), whose boundary Cg has finite 
length, and for which c € 9°, 
Equation: 


f(¢) d¢ = 0. 


Cs 


As in part (a) of [link], define F' on B, (c) by 
Equation: 


fgcl.f 20 


Ont C2 


dc. 


Then, by that exercise, F’ is analytic on B, (c). We show next that F(z) = f(z) on B; (c), 
and this will complete the proof of part (1). 


Let z be a point in B, (c) that is not equal to c, and let € > 0 be given. Choose 6 > 0 such 
that 6 < |z—c|/2andsuch that f(¢) — f(c)| < if |¢ — c| < 6. Then, using part (c) of 


[link], we have that 
Equation: 
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where the last equality holds because the function f (c)/ (¢ — z) is an analytic function of ¢ 
on the disk B; (c), and hence the integral is 0 by [link]. So, 


Equation: 
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Since this holds for arbitrary « > 0, we see that f (z) = F(z) for all z 4 cin B, (c). 


Finally, since 
Equation: 


f (c) =lim f (z) =lim z > cF (z) = F(c), 


2—>C 


the equality of F’ and fon all of B, (c) is proved. This finishes the proof of part (1). 
Exercise: 


Problem: Prove part (2) of the preceding theorem. 


Now, for the second kind of isolated singularity: 


A complex number c is called a pole of a function f if there exists an r > O such that f 
is continuous on the punctured disk B’, (c), analytic at each point of B/. (c), the point c 
is not a removable singularity of f, and there exists a positive integer k such that the 
analytic function (z — c)*f (z) has a removable singularity at c. 


A pole c of f is said to be of order n, if n is the smallest positive integer for which the 
function f (z) = (z—c)"f (z) has a removable singularity at c. 


Exercise: 


Problem: 


a. Let c be a pole of order n of a function f, and write f (z) = (z — c)"f (z). Show 


that f is analytic on some disk B, (c). 
b. Define f (z) =sin z/z? for all z 4 0. Show that 0 is a pole of order 2 of f. 


Let f be continuous on a punctured disk B’, (c), analytic at each point of Bj, (c), and 
suppose that c is a pole of order n of f. Then 


1. For all z € Bi, (c), 
Equation: 
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2. The infinite series of part (1) converges uniformly on each compact subset K of BY’ (c). 
3. For any piecewise smooth geometric set S C B, (c), whose boundary Cg has finite 
length, and satisfying c € S°, 
Equation: 
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where A_, is the coefficient of (z — c)' in the series of part (1). 


For each z € B'.(c), write f (z) = (z —c)"f (z). Then, by [link], f is analytic on B, (c), 
whence 
Equation: 


(z=0)" 4 ; 
= S\ a{z-9F, 
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where ag = Cn+k- This proves part (1). 


We leave the proof of the uniform convergence of the series on each compact subset of 
BY (c), i.e., the proof of part (2), to the exercises. 


Part (3) follows from Cauchy's Theorem ([link]) and the computations in [link]. Thus: 
Equation: 
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as desired. The summation sign comes out of the integral because of the uniform 
convergence of the series on the compact circle C’,. 
Exercise: 


Problem: 


a. Complete the proof to part (2) of the preceding theorem. That is, show that the 
infinite series 77°, ax(z — c)* converges uniformly on each compact subset A’ 
of Bi. (c). HINT: Use the fact that the Taylor series )7°°_ ¢n(z — ¢)” for f 
converges uniformly on the entire disk B, (c), and that if c is not in a compact 
subset K of B, (c), then there exists a d > 0 such that |z — c| > 6 forall z € K. 


b. Let f,c, and f be as in the preceding proof. Show that 
Equation: 


c. Suppose g is a function defined on a punctured disk BY. (c) that is given by the 
formula 
Equation: 


[) 
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for some positive integer n and for all z € BY’. (c). Suppose in addition that the 
coefficient a_, # 0. Show that c is a pole of order n of g. 


Having defined two kinds of isolated singularities of a function f, the removable ones and 
the polls of finite order, there remain all the others, which we collect into a third type. 


Let f be continuous on a punctured disk B’, (c), and analytic at each point of BY’ (c). 
The point c is called an essential singularity of f if it is neither a removable singularity 
nor a poll of any finite order. Singularities that are either poles or essential singularities 
are called nonremovable singularities. 


Exercise: 


Problem: For z # 0, define f (z) = e!/%. Show that 0 is an essential singularity of f. 


Let f be continuous on a punctured disk B’, (c), analytic at each point of Bi. (c), and 
suppose that c is an essential singularity of f. Then 


il 


For all z € Bi. (c), 
Equation: 
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where the sequence {a,}°. has the property that for any negative integer N there is a 
k < N such that ax + 0. 


. The infinite series in part (1) converges uniformly on each compact subset K of BY. (c). 


That is, if F, is defined by F,, (z) = S7%__,, ax(z — c)*, then the sequence {F,,} 
converges uniformly to f on the compact set K. 


. For any piecewise smooth geometric set S C B, (c), whose boundary Cg has finite 


length, and satisfying c € S°, we have 
Equation: 


+ (Chae = Bria 4 
Cs 


where a_ is the coefficient of (z — c)’ in the series of part (1). 


Define numbers {a,}°°. as follows. 
Equation: 


_ i (Q) 
= 277 [, Coo dc. 


Note that for any 0 < 6 < r we have from Cauchy's Theorem that 
Equation: 


ol f(¢) 


where C's denotes the boundary of the disk B; (c). 


Let z  cbe in B, (c), and choose 6 > 0 such that 6 < |z — c|. Then, using part (c) of 
[link], and then mimicking the proof of [link], we have 
Equation: 


1 1, 1 f £0 
f2) = 27% [ ee 27% [ ban 


4 #(0) 1 #(0) 
= ae ae (@-)-(C-2) 


1 fic) 1 1 f(Q) 1 de 


oo gos 
2mt Jo, G-c 1—- #£ oT aa C2 Se f= 
Z—C 


c = 
_— 1 f £0 S(z-¢\' 1 ¢ £0 S(o-e)’ 
- Pa ee ae) i 


“1 FQ = 
_ 2 ae e —~ d ¢(z—c) ee a f(¢ (¢ —c)/ dt(z—c) —j-l 


k=0 —c) 

y my £0) ; 
~ ae On, d((z—c 

D4 k( ) a 271 is (cag C( ) 

y Ky £Q) ; 
= an(z—c = dtlz—e 

de k( ) + De 271 [, (Caer cal ) 


which proves part (1). 


We leave the proofs of parts (2) and (3) to the exercises. 
Exercise: 


Problem: 


a. Justify bringing the summation signs out of the integrals in the calculation in the 
preceding proof. 
b. Prove parts (2) and (3) of the preceding theorem. Compare this with [link]. 


REMARK The representation of f(z) in the punctured disk By, (c) given in part (1) of [link] 
and [link] is called the Laurent expansion of f around the singularity c. Of course it differs 
from a Taylor series representation of f, as this one contains negative powers of z — c. In 
fact, which negative powers it contains indicates what kind of singularity the point c is. 


Non removable isolated singularities of a function f share the property that the integral of f 
around a disk centered at the singularity equals 272a_;, where the number a_, is the 
coefficient of (z — c) in the Laurent expansion of f around c. This number 2772a_, is 
obviously significant, and we call it the residue of f at c, and denote it by R+(c). 


Combining [link], [link], and [link], we obtain: 
Residue Theorem 


Let S be a piecewise smooth geometric set whose boundary has finite length, let c1,...,¢, be 
points in S°, and suppose f is a complex-valued function that is continuous at every point z 
in S except the c;'s, and differentiable at every point z € S° except at the c,'s. Assume 
finally that each cz is a nonremovable isolated singularity of f. Then 

Equation: 


n 
f(C) dC = S7 Rs (cr). 
Cs k=1 
That is, the contour integral around C’g is just the sum of the residues inside S. 
Exercise: 


Problem: Prove [link]. 


Exercise: 


Problem: 


Use the Residue Theorem to compute f Cs f (¢) dé for the functions f and geometric 


sets S' given below. That is, determine the poles of f inside S, their orders, the 
corresponding residues, and then evaluate the integrals. 


z) =sin (32) /z?, and)S =.B, (0). 
1) = a and S = B, (0). 
) =el/” and S = B; (0). 


a : ee ee —1)), and S = B, (0). 
efi (1 — 27) /z(1+27)(22+ Nae and S = Bz (0). 
f. f(z) =1/ (1+ 2) = (1/ (2 - 4) (2 +4), and S = B, (0) foranyr > 1. 


The Residue Theorem, a result about contour integrals of functions of a complex variable, 
can often provide a tool for evaluating integrals of functions of a real variable. 


Example: 
Consider the integral 
Equation: 


te 1 
/ Seah 
_« l+24 


Let us use the Residue Theorem to compute this integral. 
Of course what we need to compute is 


Equation: 
B 
1 
lim / eee 


The first thing we do is to replace the real variable x by a complex variable Z, and observe 
that the function f (z) = 1/ (al + Z°) is analytic everywhere except at the four points 
+e’"/4 and +e'7/*, See part (f) of the preceding exercise. These are the four points whose 
fourth power is —1, and hence are the poles of the function f. 

Next, given a positive number B, we consider the geometric set (rectangle) S'g that is 
determined by the interval |—B, B] and the two bounding functions J(x) = 0 and 

u(x) = B. Then, as long as B > 1, we know that f is analytic everywhere in S° except at 
the two points c, = e’”/4 and cy = e**”/4, so that the contour integral of f around the 
boundary of Sg is given by 

Equation: 


1 
[ho Tega = Pele) + Relea) 


Now, this contour integral consists of four parts, the line integrals along the bottom, the two 
sides, and the top. The magic here is that the integrals along the sides, and the integral along 
the top, all tend to 0 as B tends to infinity, so that the integral along the bottom, which after 
all is what we originally were interested in, is in the limit just the sum of the residues inside 
the geometric set. 


Exercise: 


Problem: Verify the details of the preceding example. 


a. Show that 
Equation: 
ae 1 
lim Se 
Bow Jo 14+(B+it) 


b. Verify that 


Equation: 
: 1 
lim ri dt = 0. 
Bow J_p1+(t+iB) 
c. Show that 
Equation: 


a 1 
i 7 da = V2. 


o l+2 


Methods similar to that employed in the previous example and exercise often suffice to 
compute integrals of real-valued functions. However, the method may have to be varied. For 
instance, sometimes the appropriate geometric set is a rectangle below the z-axis instead of 
above it, sometimes it should be a semicircle instead of a rectangle, etc. Indeed, the choice of 
contour (geometric set) can be quite subtle. The following exercise may shed some light. 
Exercise: 


Problem: 


a. Compute 


Equation: 


and 
Equation: 
(oe) er 
/ ——— 
_o 1+24 
b. Compute 
Equation: 
°° sin (—2 
/ (—z) 4 
_o 1+23 
and 
Equation: 
oo sin x 
i d 
_o 1l+23 
Example: 


An historically famous integral in analysis is f Ese sin «/a dx. The techniques described 
above don't immediately apply to this function, for, even replacing the zx by a z, this 
function has no poles, so that the Residue Theorem wouldn't seem to be much help. Though 
the point 0 is a singularity, it is a removable one, so that this function sin z/z is essentially 
analytic everywhere in the complex plane. However, even in a case like this we can obtain 
information about integrals of real-valued functions from theorems about integrals of 
complex-valued functions. 

Notice first that f°. sin x/zx dz is the imaginary part of [°° e’”/x dz, so that we may as 
well evaluate the integral of this function. Let f be the function defined by f (z) = e’*/z, 
and note that 0 is a pole of order 1 of f, and that the residue R+ (0) = 27%. Now, for each 
B > Oand6 > 0 define a geometric set $3.5, determined by the interval |—B, B], as 
follows: The upper bounding function wz is given by ups (x) = B, and the lower 
bounding function Jz 5 is given by lg5 (x) = 0 for —B< x < —dandé < a < B,and 
‘Bees Re de'™/9 for —§ < x < 6. Thatis, S$ B,6 is just like the rectangle Sz in Example 1 
above, except that the lower boundary is not a straight line. Rather, the lower boundary is a 


straight line from —B to —d, a semicircle below the x-axis of radius 6 from —6 to 6, anda 
straight line again from 6 to B. 

By the Residue Theorem, the contour integral 

Equation: 


f (¢) d¢ = Ry (0) = 2m. 


As in the previous example, the contour integrals along the two sides and across the top of 
S'p.6 tend to 0 as B tends to infinity. Finally, according to part (e) of [link], the contour 
integral of f along the semicircle in the lower boundary is 77 independent of the value of 6. 
So, 


Equation: 
e's 

lim lim / — dG =m, 

Bowd>0 graph(Ip,s) G 
implying then that 
Equation: 

Sie 
if do =r 
Sou ee. 
Exercise: 
Problem: 


a. Justify the steps in the preceding example. In particular, verify that 


Equation: 
B e( Brit) 
lim / — dt = 0, 
Boo Jog B+it 
Equation: 
B eilt+iB) 
lim — dt = 0, 
and 
Equation: 


where C's is the semicircle of radius 6, centered at the origin and lying below the x 
-axis. 

b. Evaluate 
Equation: 


Appendix: Existence and Uniqueness of a Complete Ordered Field 

This appendix is devoted to the proofs of Theorems 1 and 2 in "The Real and 
Complex Numbers: The Real Numbers," which together assert that there exists a 
unique complete ordered field. Our construction of this field will follow the ideas of 
Dedekind, which he presented in the late 1800's. 


This appendix is devoted to the proofs of [link] and [link], which together assert that 
there exists a unique complete ordered field. Our construction of this field will 
follow the ideas of Dedekind, which he presented in the late 1800's. 


By a Dedekind cut, or simply a cut, we will mean a pair (A, B) of nonempty 
(not necessarily disjoint) subsets of the set Q of rational numbers for which the 
following two conditions hold. 


1. AU B= Q. That is, every rational number is in one or the other of these 
two sets. 

2. For every element a € A and every element b € B,A < b. That is, every 
element of A is less than or equal to every element of B. 


Recall that when we define the rational numbers as quotients (ordered pairs) of 
integers, we faced the problem that two different quotients determine the same 
rational number, e.g., 2/3 = 6/9. There is a similar equivalence among Dedekind 
cuts. 


Two Dedekind cuts (A,, b;) and (Az, Bg) are called equivalent if ay < by for 
all a; € A, and all b2 € Bo, and az < 0, forall ag € Ag and all b} € By. In 
such a case, we write (A;, By) = (Ao, Bo). 


Exercise: 


Problem: 


a. Show that every rational number r determines three distinct Dedekind cuts 
that are mutually equivalent. 

b. Let B be the set of all positive rational numbers r whose square is greater 
than 2, and let A comprise all the rationals not in B. Prove that the pair 
(A, B) is a Dedekind cut. Do you think this cut is not equivalent to any cut 
determined by a rational number r as in part (a)? Can you prove this? 

c. Prove that the definition of equivalence given above satisfies the three 
conditions of an equivalence relation. Namely, show that 


i. (Reflexivity) (A, B) is equivalent to itself. 
ii. (Symmetry) If (Ai, B1) = (Ag, Ba), then (Az, Be) = (A1, B:). 


iii. (Transitivity) If (Ai, Bi) = (Ae, Bz) and (A2, Bz) = (A3, Bs), 
then (Ai, B;) = (Az, B3). 


There are three relatively simple-sounding and believable properties of cuts, and we 
present them in the next theorem. It may be surprising that the proof seems to be 
more difficult than might have been expected. 


Let (A, B) be a Dedekind cut. Then 


1.Ifa € Aanda’ <a, thena’ € A. 

2.1fb € Band b’ > b, thend’ € B. 

3. Let € be a positive rational number. Then there exists ana € A andabe B 
such that b—a < e. 


Suppose a is an element of A, and let a’ < a be given. By way of contradiction 
suppose that a’ does not belong to A. Then, by Condition (1) of the definition of a 
cut, it must be that a’ € B. But then, by Condition (2) of the definition of a cut, we 
must have that a < a’, and this is a contradiction, because a’ < a. This proves part 
(1). Part (2) is proved in a similar manner. 


To prove part (3), let the rational number € > 0 be given, and set r = €/2. Choose 
an element ap € A and an element bp € B. Such elements exist, because A and B 
are nonempty sets. Choose a natural number N such that aj + Nr > bo. Sucha 
natural number NV must exist. For instance, just choose N to be larger than the 
rational number (bo — ao) /r. Now define a sequence {a;} of rational numbers by 
Q~ = ag + kr, and let K be the first natural number for which ax € B. Obviously, 
such a number exists, and in fact K must be less than or equal to NV. Now, ax_, is 
not in B, so it must be in A. Seta = Ax_, andb= Ax. Clearly, a € A,b € B, and 
Equation: 


b—a=ax ~ ax; = a9 + Kr—ag~(K-l)r=r=> <e, 


and this proves part (3). 


We will make a complete ordered field F' whose elements are the set of equivalence 
classes of Dedekind cuts. We will call this field the Dedekind field. To make this 
construction, we must define addition and multiplication of equivalence classes of 
cuts, and verify the six required field axioms. Then, we must define the set P that is 
to be the positive elements of the Dedekind field F’, and then verify the required 


properties of an ordered field. Finally, we must prove that this field is a complete 
ordered field; i.e., that every nonempty set that is bounded above has a least upper 
bound. First things first. 


If (Ai, Bi) and (Ag, Bz) are Dedekind cuts, define the sum of (A, Bi) and 
(Az, Bz) to be the cut (A3, B3) described as follows: B3 is the set of all 
rational numbers bs that can be written as b; + by for some 6; € By, and 


by € Bo, and As is the set of all rational numbers r such that r < 63 for all 
bz € Bg. 


Several things need to be checked. First of all, the pair (A3, B3) is again a Dedekind 
cut. Indeed, it is clear from the definition that every element of Ag is less than or 
equal to every element of Bs, so that Condition (2) is satisfied. To see that Condition 
(1) holds, let r be a rational number, and suppose that it is not in A3. We must show 
that r belongs to Bs. Now, since r ¢ A3, there must exist an element 

bs = b; + by € Bs for which r > b3. Otherwise, r would be in A3. But this means 
that r — bz > by, and so by part (2) of [link], we have that r — bz is an element b‘, of 
B,. Therefore, r = 6), + bg, implying that r € Bs, as desired. 


We define the 0 cut to be the pair Ag = {r: r < 0} and Bo = {r: r > O}. This cut 
is one of the three determined by the rational number 0. 
Exercise: 


Problem: 


a. Prove that addition of Dedekind cuts is commutative and associative. 

b. Prove that if (Ai, B,) = (C1, D;) and (Ag, Bz) = (Co, Dg), then 
(Ai, Bi) + (Ao, Ba) = (Ci, D1) + (C2, D2). 

c. Find an example of a cut (A, B) such that (A, B) + 0 4 (A, B). 

d. Prove that (A, B) + 0 = (A, B) for every cut (A, B). 


We define addition in the set F' of all equivalence classes of Dedekind cuts as 
follows: 


If x is the equivalence class of a cut (A, b) and y is the equivalence class of a 
cut (C, D), then x + y is the equivalence class of the cut (A, B) + (C, D). 


It follows from the previous exercise, that addition in F' is well-defined, 
commutative, and associative. We are on our way. 


We define the element 0 of F' to be the equivalence class of the 0 cut. The next 
theorem establishes one of the important field axioms for F’, namely, the existence of 
an additive inverse for each element of F’. 


If (A, B) is a Dedekind cut, then there exists a cut (A’, B’) such that 
(A, B) + (A’, B’) is equivalent to the 0 cut. Therefore, if x is an element of F’, then 
there exists an element y of F' such that x + y = 0. 


Let A’ = —B, i.e., the set of all the negatives of the elements of B, and let 

B' = —A, i.e., the set of all the negatives of the elements of A. It is immediate that 
the pair (A’, B’) is a Dedekind cut. Let us show that (A, B) + (A’, B’) is equivalent 
to the zero cut. Let (C', D) = (A, B) + (A’, B’). Then, by the definition of the sum 
of two cuts, we know that D consists of all the elements of the form 

d=b+b' =b—a, whereb € Banda € A. Sincea < b foralla € Aandbe B, 
we see then that the elements of D are all greater than or equal to 0. To see that 

(C, D) is equivalent to the 0 cut, it will suffice to show that D contains all the 
positive rational numbers. (Why?) Hence, let € > 0 be given, and choose ana € A 
and ab € Bsuch that b — a < e. This can be done by Condition (3) of [link]. Then, 
the number b — a € D, and hence, by part (2) of [link], e € D. It follows then that 
the cut (C’, D) is equivalent to the zero cut (Ao, Bo), as desired. 


We will write —(A, B) for the cut (A’, B’) of the preceding proof. 
Exercise: 
Problem: 
a. Suppose (A, B) is acut, and let (C’, D) be a cut for which 
(A, B) + (C, D) is equivalent to the 0 cut. Show that 
(C, D) = (A’, B’) = — (A, B). 


b. Prove that the additive inverse of an element z of the Dedekind field F is 
unique. 


The definition of multiplication of cuts, as well as multiplication in F’, is a bit more 
tricky. In fact, we will first introduce the notion of positivity among Dedekind cuts. 


A Dedekind cut x = (A, B) is called positive if A contains at least one positive 
rational number. 


Exercise: 


Problem: 


feb) 


io) 


. Suppose (A, B) and (C, D) are equivalent cuts, and assume that (A, B) is 


positive. Prove that (C’, D) also is positive. Make the obvious definition of 
positivity in the set F’. 


. Show that the sum of two positive cuts is positive. Conclude that the sum 


of two positive elements of F’, i.e., the sum of two equivalence classes of 
positive cuts, is positive. 


. Let (A, B) be a Dedekind cut. Show that one and only one of the 


following three properties holds for (A, B). (i) (A, B) is a positive cut, (ii) 
—(A, B) is a positive cut, or (iii) (A, B) is equivalent to the 0 cut. 


. Establish the law of tricotomy for F’ : That is, show that one and only one 


of the following three properties holds for an element x € F’. (i) x is 
positive, (ii) —z is positive, or (iii) c = 0. 


We first define multiplication of cuts when one of them is positive. 


Let (A1, B,) and (Az, Bz) be two Dedekind cuts, and suppose that one of 
these cuts is a positive cut. We define the product(A3, B3) of (A;, B,) and 
(Az, Bz) as follows: Set B3 equal to the set of all bs that can be written as b;b2 
for some b; € By, and by € Bo. Then set A3 to be all the rational numbers r for 
which r < bs for all b3 € Bg. 


Again, things need to be checked. 


Exercise: 
Problem: 

a. Show that the pair (A3, B3) of the preceding definition for the product of 
positive cuts is in fact a Dedekind cut. 

b. Prove that multiplication of Dedekind cuts, when one of them is positive, 
is commutative. 

c. Suppose (Aj, B) is a positive cut. Prove that 
Equation: 
(Ai, B1) ((A2, Be) + (As, Bs)) = (A1, Bi) (A2, Bo) + (Ar, B1) (Az, Bs) 
for any cuts (A2, Be) and (As3, Bs). 

d. Show that, if (A1, By) = (Ag, Bz) and (Ci, D1) = (C2, Dz) and 
(a,, B,) and (Ag, Bo) are positive cuts, then 
(Aj, B,) (Ci, D,) = (Ap, By) (C2, Dy). 

e. Show that the product of two positive cuts is again a positive cut. 


We are ready to define multiplication in F’. 
Let x and y be elements of F’. 


If either x or y is positive, define the product z x y to be the equivalence class 
of the cut (A, B)(C, D), where x is the equivalence class of (A, B) and y is 
the equivalence class of (C, D). 


If either x or y is 0, define x x y to be 0. 


If both x and y are negative, i.e., both —x and —y are positive, define 
xx y= (—a) x (—y). 


The next exercise is tedious. It amounts to checking a bunch of cases. 
Exercise: 


Problem: 


a. Prove that multiplication in F’ is commutative. 

b. Prove that multiplication in F’ is associative. 

c. Prove that multiplication in F’ is distributive over addition. 

d. Prove that the product of two positive elements of F' is again positive. 


We define the element 1 of F’ to be the equivalence class of the cut (Al, a), where 
Ab=adrir = lhand B= trie S 1}. 
Exercise: 


Problem: 


a. Prove that the elements 0 and 1 of F are not equal. 

b. Prove that x x 1 = z for every element x € F’. 

c. Use the associative law and part (b) to prove that if ry = 1 andzz=1, 
then y = z. 


With respect to the operations of addition and multiplication defined above, together 
with the definition of positive elements, F' is an ordered field. 


The first five axioms for a field, given in [link], have been established for F’ in the 
preceding exercises, so that we need only verify axiom 6 to complete the proof that 
F is a field. Thus, let z € F be a nonzero element. We must show the existence of an 
element y of F for which x x y = 1. Suppose first that x is a positive element of F’. 


Then z is the equivalence class of a positive cut (A, B), and therefore A contains 
some positive rational numbers. Let ap be a positive number that is contained in A. 
It follows then that every element of B is greater than or equal to ag and hence is 


positive. Define B to be the set of all rational numbers r for which r > 1 /b for every 
b € B. Then define A to be the set of all rationals r for which r < b for every 
b € B. It follows directly that the pair (4, B) is a Dedekind cut. 


Let (C, D) = (A, B) x (4, B), and note that every element d € D is of the form 


d = bb, and hence is greater than or equal to 1. We claim that (C, D) is equivalent to 
the cut (At, B') that determines the element 1 of F’. To see this we must verify that 
D contains every rational number r that is greater than 1. Thus, let r > 1 be given, 
and set € = ag (r — 1). From Condition (3) of [link], choose ana’ € Aandab’ € B 
such that b’ — a’ < e. Without loss of generality, we may assume that a’ > ap. 


Finally, set 6 = 1 /a’. Clearly 6 > 1 /b for all b € B, so that 6 € B. Also 
d= b'b€ D, and 
Equation: 


implying that r € D. Therefore, (C’, D) is equivalent to the cut (A1, B'), implying 
that (A, B) x (4, B) is equivalent to the cut (A’, B'). Therefore, if y is the 


element of F' that is the equivalence class of the cut (4, B) , then x x y= 1,as 


desired. 


If x is negative, then —z is positive. If we write z for the multiplicative inverse of 
the positive element —x, then —z is the multiplicative inverse of the element x. 
Indeed, by the definition of the product of two negative elements of F’, 

gx (—z) =(-#) x z=1. 


The properties that guarantee that F’ is an ordered field also have been established in 
the preceding exercises, so that the proof of this theorem is complete. 


So, the Dedekind field is an ordered field, but we have left to prove that it is 
complete. This means we must examine upper bounds of sets, and that requires us to 
understand when one cut is less than another one. We say that a cut (A, B) is less 


than or equal to a cut C, D) if a < d for every a € A andd € D. We say that an 
element x in the ordered field F' is less than or equal to an element y if y — = is 
either positive or 0. 


Let x and y be elements of F’, and suppose z is the equivalence class of the cut 
(A, B() and y is the equivalence class of the cut (C', D). Then x < yif and only if 
(A, B) < (C,D). 


We have that z < y if and only if the element y — x = y+ —z is positive or 0. 
Writing, as before, (A’, B’) for the cut —(A, B), we have that y — z is the 
equivalence class of the cut (Cd) — (A, B) = (C, D) + (A’, B’), so we need to 
determine when the cut (G, H) = (C, D) + (A’, B’) is a positive cut or the 0 cut; 
which is the case when the set H only contains nonnegative numbers. By definition 
of addition, the set H contains all numbers of the form h = d+ b’ forsome d € D 
and some b’ € B’. Since B’ = —A, this means that H consists of all elements of the 
form h = d—a forsomed € D anda € A. Now these numbers hf are all greater 
than or equal to 0 if and only if each a € A is less than or equal to each d € D, i.e., 
if and only if (A, B) < (C, D). This proves the theorem. 


We are now ready to present the first of the two main theorems of this appendix, that 
is [link] in [link]. 


There exists a complete ordered field. Indeed, the Dedekind field F’ is a complete 
ordered field. 


Let S be a nonempty subset of F’, and suppose that there exists an upper bound for 
S;i.e., an element M of F' such that x < M for all x € S. Write (A, B) for a cut 
such that M is the equivalence class of (A, B). We must show that there exists a 
least upper bound for S. 


For each x € S, let (A,, B,) be a Dedekind cut for which z is the equivalence class 
of (A,, B,), and note that a, < b for all a, € A, and all b € B. Let Ao be the 
union of all the sets A, for x € S. Let Bo be the set of all rational numbers r for 
which r > ao for every ag € Apo. we claim first that the pair (Ao, Bo) is a Dedekind 
cut. Both sets are nonempty; Ao because it is the union of nonempty sets, and Bo 
because it contains all the elements of the nonempty set B. Clearly Condition (2) for 
a cut holds from the very definition of this pair. To see Condition (1), let r be a 
rational number that is not in Bo. We must show that it is in Ap. Now, since r is not 
in Bo, there must exist some ag € Ao for which r < ao. But ag € Uzeg Az, so that 
there must exist an x € S such that ap € Az, and hence r is also in A,. But then 

r € Ag, and this proves that (Ag, Bo) is a Dedekind cut. 


Let Mp be the equivalence class determined by the cut (Ao, Bo). Since each 

A, C Ao, we see that a, < bo for every a, € A, and every bp € Bo. Hence, 
(A,,B,) < (Ao, Bo) for every x € S, and therefore, by Theorem A.4, x < Mp for 
all x € S. This shows that Mp is an upper bound for S. 


Finally, suppose M’ is another upper bound for S, and let (A’, B’) be a cut for 
which M’ is the equivalence class of (A’, B’). Then az < 0’ for every az € A, and 
every b’ € B’, implying that ag < b’ for every ag € Ao and every b’ € B’. 
Therefore, (Ao, Bo) < (A’, B’), implying that Mp < M’. This shows that Mp is 
the least upper bound for S,, and the theorem is proved. 


We come now to the second major theorem of this appendix, i.e., [link] of [link]. 
This one asserts the uniqueness, up to isomorphism, of complete ordered fields. 


Let F bea complete ordered field. Then there exists an isomorphism of F onto the 


Dedekind field F’. That is, there exists a one-to-one function J : F — F that is onto 
all of F’, and that satisfies 


1. J(x +y) = J(x) + Jy). 
2. J(xy) = J(x)J(y). 


3. Ife > 0, then J(x) > 0. 


We know from [link] that, inside any ordered field, there is a subset that is 
isomorphic to the field Q of rational numbers. We will therefore identify this special 


subset of F with Q. 


If x is an element of F', let A, = {r €Q:r <x} andlet B, = {r€Q:r> a}. 
We claim first that the pair (A,, B,,) is a Dedekind cut. Indeed, from the definition 
of A, and B,, we see that Condition (2), i.e., that each az € A, is less than or equal 
to each b, € By, holds. To see that Condition (1) also holds, let r be a rational 


number in F. Then, because F is an ordered field, either r < xz orr > 2, i.e., 
r€ A, orr € By. Hence, (A,, B,) is a Dedekind cut. 


We define a function J from F into F by setting J(a) equal to the equivalence class 
determined by the cut (A,, B,). We must check several things. 


First of all, J is one-to-one. Indeed, let x and y be elements of F that are not equal. 
Assume without loss of generality that 2 < y. Then, according to [link], which is a 
theorem about complete ordered fields and hence applicable to F’,, there exist two 

rational numbers r; and rg such that x < ry < rg < y, which implies that r; € B, 


and rg € Ay. Since rg > 11, the cut (A,, B,) is not equivalent to the cut (Az, B;), 
and therefore J(x) A J(y). 


Next, we claim that the function J is onto all of the Dedekind field F’. Indeed, let z 
be an element of F’, and let (A, B) be a Dedekind cut for which z is the equivalence 
class determined by (A, B). Think of A as a subset of the complete ordered field BF. 
Then A is nonempty and is bounded above. In fact, every element of B is an upper 
bound of A. Let 2 =sup A. (Here is another place where we are using the 
completeness of the field f.) We claim that the cut (A, B) is equivalent to the cut 
(A,, Bz), which will imply that J(x) = z. Thus, if a, € A,, thena, < a, and 

x < b for every b € B, because z is the least upper bound of A. Similarly, if a € A, 
then a < az, and x < b, for every b, € B,. This proves that the cuts (A, B) and 
(A,, B,) are equivalent, as desired. 


If x and y are elements of F’, and b, € B, and b, € By, then b, > x and by > y, so 
that b; + 6, > x + y, and therefore b, + b, € Bz+, for every 6, € B, and 

b, € B,. On the other hand, ifr € Bz+,, thenr > x + y. Therefore, r — x > y, 
implying, again by [link], that there exists an element b, € B, such that 

y <b, <r —«. But then r — 6, > x, which means that r — b, = 6b; for some 

b, € Bz. So, r = bz + by, and this shows that B,,, = b; + B,. It follows from 
this that the cuts (A;4,, Bz+,) and (A,;, B,) + (A,, By) are equal, and therefore 
J(xz + y) = J(x) + J(y). A consequence of this is that J(—x) = —J(z) for all 


ceF. 


If x and y are two positive elements of F then an argument just like the one in the 
preceding paragraph shows that J(xy) = J(x)J(y). Then, since J(—x) = —J(z), 
the fact that J(ay) = J(x)J(y) forall x, y € F follows. 


Finally, if x is a positive element of F, then the set A, must contain some positive 
rationals, and hence the cut (A,, B,.) is a positive cut, implying that J(x) > 0. 


We have verified all the requirements for an isomorphism between the two fields F 
and F’, and the theorem is proved. 


